GIANT VACANT COMPONENT LEFT BY A RANDOM WALK 
IN A RANDOM d-REGULAR GRAPH 
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Abstract. We study the trajectory of a simple random walk on a d-regular 
graph with c? > 3 and locally tree-like structure as the number n of vertices 
grows. Examples of such graphs include random d-regular graphs and large girth 
expanders. For these graphs, we investigate percolative properties of the set of 
vertices not visited by the walk until time un, where u > is a fixed positive 
\ parameter. We show that this so-called vacant set exhibits a phase transition 

' in u in the following sense: there exists an explicitly computable threshold € 

Q I (0,00) such that, with high probability as n grows, if u < u^, then the largest 

O ' component of the vacant set has a volume of order n, and if u > tt^, then it has 

I a volume of order log n. The critical value coincides with the critical intensity 

■ of a random interlacement process on a c?-regular tree. We also show that the 

, random interlacements model describes the structure of the vacant set in local 

neighbourhoods. 

^ ■ 1. Introduction and main results 

a . 

In this work we consider the simple random walk on a graph G chosen among a 
certain class of finite regular graphs including, for example, typical realizations of 
random regular graphs, or expanders with large girth. The main object of our study 
^ ■ is the complement of the trajectory of the random walk stopped at a time u\G\, for 

. u > 0, the so-called vacant set, and its percolative properties. 

We show that the vacant set undergoes the following phase transition in u: as 
10 ' long as M < u^,, the vacant set has a unique component with volume of order 

CSJ ■ whereas if m > u^, the largest component of the vacant set only has a volume of 

order logjCI, with high probability as the size of G diverges. More importantly, 
we show that the above phase transition corresponds to the phase transition in a 
random interlacement model on a regular tree. In particular, the critical value u^, is 
v~j ■ the same for both models. 

. The random interlacement (on X'^, d > 3) was recently introduced by Sznitman 

5^ \ |Szn09ej to provide a model describing the microscopic structure of the bulk when 

considering the large asymptotics of the disconnection time of the discrete cylinder 
{Z/NZY~^ X Z |DS06j . or percolative properties of the vacant set left by the simple 
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random walk on the torus {'L/N'LY jBSOS] . Later, in |Szn09c| IWinOS] . it was proved 
that the random interlacements indeed locally describe this microscopic structure. 
In |Szn09el [SS09J it was shown that the random interlacement undergoes a phase 
transition for a non-trivial value m^(Z'^) of the parameter u driving its intensity. The 
best bounds on the disconnection time known from |Szn09bj and |Szn09d] involve 
parameters derived from random interlacements and the upcoming work |TW10j 
connects distinct regimes for random interlacements with distinct regimes for the 
vacant set left by random walk on the torus. It can currently not be proved that 
the critical value ti*(Z'^) for random interlacements is itself connected with a critical 
value for the vacant set on the torus or for the disconnection time. 

We provide such a connection in our model. This is possible for the following 
reasons: For the considered graphs G, large neighbourhoods of typical vertices of G 
are isomorphic to a ball in a regular tree, and, as we will show, the corresponding 
local microscopic model is the random interlacement on such a tree. Connected 
components of this interlacement model admit a particularly simple description in 
terms of a branching process, and its critical value is explicitly computable (see 
fll.2p ). giving us a good local control of configurations of the vacant set on G. Good 
expansion properties of G then allow us to extend the local control to a global one. 

We now come to the precise statements of our results. We consider a sequence 
of finite connected graphs Gk = {Vk^Sk) such that the number of vertices in Vk 
tends to infinity as A; — )■ oo. We are principally interested in the case where Gk is 
a sequence of d-regular random graphs, d > 3, or of (i-regular expanders with large 
girth (such as, for example, Lubotzky-Phillips-Sarnak graphs |LPS88j ). As we shall 
show below, these two classes of graphs satisfy the following assumptions, which are 
the only assumptions we need in order to prove our main theorems. We assume that 
for some d > 3, ai G (0, 1), and all k, 

(AO) Gk is (i-regular, that is all its vertices have degree d, and 



We also assume that the spectral gap Xg^ of Gk (we recall the definition in (12. 7p 
below) is uniformly bounded from below by a constant a2 > 0, that is 



Under flAO|) . this final assumption is equivalent to assuming that Gk are expanders, 
see (12. lip . Note that in general flAip does not imply (IA2p , see Remark 11.51 

We consider a continuous-time random walk on Gk- More precisely, we write P 
for the canonical law on the space -D([0, oo), Vk) of cadlag functions from [0, oo) to 
Vk of the continuous-time simple random walk on Gk with i.i.d. mean-one exponen- 
tially distributed waiting times and uniformly distributed starting point. We use 
{Xt)t>o to denote the canonical coordinate process. For a fixed parameter m > not 
depending on k, we define the vacant set as the set of all vertices not visited by the 
random walk until time uuk'. 



(Al) 



for any x G Vk, there is at most one cycle contained in the ball with 
radius ai log^^i Uk centered at x. 



(A2) 



Agj. > ^2 > 0, for all k>l. 



(1.1) 



Vl = {xeVk:x^ Xt, for all < t < uUk}. 
C to denote the largest connected component of V^. 



We use C, 



max 
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The following theorems are the main results of the present paper. The critical 
parameter in the statements coincides with the critical parameter for random 
interlacements on the infinite d-regular tree T^, which, according to |Tei09] . equals 

d{d-l)ln{d-l) 



1.2 



{d-2f 



Theorem 1.1 (subcritical phase). Assume flA0p - flA2p . and fix u > u^. Then for 
every cr > there exist constants K{d, a, u, ai, 02), C{d, a, u, ai, 0:2) < 00, such that 

(1.3) P[\C^J > Klnuk] < for all k>l. 

Theorem 1.2 (supercritical phase). Assume (IA0p - flA2p . and fix u < m^. Then for 
every a > there exist constants p{d, a, u, ai, 02) G (0, 1) andC{d, a, u, ai, 0^2) < 00, 
such that 

(1.4) P[|Caxl > Pnk] > 1 - Cr^,^ for all k > 1. 

For the statement on the uniqueness of the giant component we denote the second 
largest component of by C^^,. 



Theorem 1.3 (supercritical phase-uniqueness) . Assume flA0p - flA2p . andfixu < u^. 
Then for every k > 0, 

(1.5) \imP[\C:j>Knk]=0. 

From the last theorem it follows that there exists a function / satisfying f{n) = 
o{n) and P[|C^^| < f{nk)] — ^ 1. More information on the asymptotics of /(n) could 
be obtained from our techniques. However, they are not sufficient to prove / = 
O(logn), which is the conjectured size of C^^, based on the behaviour of Bernoulli 
percolation. 

Let us now comment on related results. The size of the vacant components left 
by a random walk on a finite graph has so far only been studied by Benjamini and 
Sznitman in [BS08J for Gk given by a (i-dimensional integer torus with large side 
length k and sufficiently large dimension d. In this case, the authors prove that 
the vacant set has a suitably defined unique giant component occupying a non- 
degenerate fraction of the total volume with overwhelming probability, provided 
M > is chosen sufficiently small. Their work does not prove anything, however, 
for the large u regime, let alone any results on a phase transition in u. Our results 
are the first ones to establish such a phase transition for a random walk on a finite 
graph. Moreover, our results provide some indication that a phase transition occurs 
for random walk on the torus as well, and that the critical parameter m^(Z'^) for 
random interlacements on Z"' should play a key role. 

A similar phase transition was proved for Bernoulli percolation on various graphs: 
first by Erdos and Renyi |ER60] on the complete graph, and more recently on large- 
girth expanders in [AB S04] . as well as on many other graphs satisfying a so-called 
triangle condition jBCvdH"'"05] . For our results the paper )ABS04j is the most rel- 
evant, some of our proofs build on techniques introduced there. A very precise 
description of the Bernoulli percolation on random regular graphs was recently ob- 
tained in |NP09l[Pit08] . 
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Let us now comment on the proofs of our results. For most of the arguments, we 
do not work with the law P of the random walk, but with a different measure Q 
on -D([0, oo), Vk). The trajectory of the canonical process X under Q is constructed 
from an i.i.d. sequence of uniformly-started random walk trajectories of 

length L = nl, for 7 < 1, called segments. To create a nearest-neighbour path, the 
endpoint of segment and the starting point of segment are connected using 
a bridge Z^, i e N, which is a random walk bridge of length i = log^n^. Since i 
is much larger than the mixing time of the random walk on Gk, Q provides a very 
good approximation of P, see Lemma 14.11 

The set = Vfc \ Uj<L„„j./(L+£)j RanF*, the so-called vacant set left by segments, 
plays a particular role in our proofs. It is a complement of 'a cloud of independent 
random walk trajectories', similar to the vacant set of a random interlacement. 
Observe that is an enlargement of V^. 

To prove Theorem 11.11 we analyse a breadth-first search algorithm exploring one 
component of the set V^. We show that this algorithm is likely to terminate in no 
more than K In n^, steps. To prove this, we need to control the probability that a (not 
yet explored) vertex y is found to be vacant at a particular step of the algorithm. 
The main difficulty is that, unlike in Bernoulli site percolation models, this event 
is not independent of the past of the algorithm. We will derive an estimate of the 
form (see Proposition 13. 6p 

(1.6) P[y^ Rany^nRanr = 0] ~ /(d, u)(^+^)/(""), 

where A will be the part of already explored by the algorithm. The explicitly 
computable quantity f{d,u) appears in the study of random interlacement on the 
infinite tree at level u |Tei09] , and it equals the probability that a given vertex z 
(different from the root of the tree) is vacant, given its parent in the tree is vacant. 

The estimate (11.61) will imply that the probability of y being vacant given the 
past of the algorithm is well approximated by f{d,u). Since for m > we have 
f{d,u) < l/{d— 1), the considered breadth-first search algorithm can be controlled 
by a sub-critical branching process, yielding Theorem 11.11 

There is an additional difficulty coming from the fact that the estimate (11.61) 
holds only under suitable restrictions on the set A and the vertex y (see (I3.28P ). 
These restrictions are however always satisfied for a large majority of the steps of 
the algorithm, as we will show in Proposition 15.41 

We now comment on the proof of Theorem 1 1.21 This proof consists of the following 
two steps: first we show that for some slightly larger parameter Uk € {u,Ui,), there 
are many components of V^'' having volume at least ra^, for some 5 > 0. Then, 
we use a sprinkling technique, based on the following heuristic idea: we reduce Uk 
to u and prove that with high probability, the mentioned components merge into a 
cluster of size at least pn^, cf. |ABS04j . 

For the first step, we use the fact that V^* can be locally compared with the vacant 
set of random interlacements on a (i-regular tree. This is proved in Proposition 16. 3[ 
which again uses an approximation of type (11.61) . see (16.381) . Since Uk < u^, the 
random interlacement at level Uk is super-critical, yielding the existence of compo- 
nents of volume nf. in Gk- Lemma [6.91 then implies that going from V^*" to V^*' (by 
inserting the bridges Z*) does not destroy these components. 
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Regarding the second step, it is by no means obvious how to perform a sprinkhng 
as mentioned above. Indeed, a simple deletion of the last part X[unk,uknk] of the tra- 
jectory would require us to deal with the distribution of the set ^j.^^.] given V^'', 
which seems difficult. Instead, we perform the sprinkling in the manner natural for 
random interlacements (cf. |Szn09aj ): we remove some segments independently 
at random. 

The deletion of segments, however, disconnects the trajectory of the process. We 
bypass this problem by adding extra bridges before the sprinkling (cf. fl6.52p and 
Lemma 16.91 again) , so that even after the deletion of some segments, we can extract 
a nearest-neighbour trajectory of length at least un, with high probability. 

We then use the expansion properties (cf. (12.111) ) of our graph to show that the 
sprinkling construction merges some of the clusters of size into a giant component 
of size at least puk- 

The proof of the uniqueness, that is of Theorem 11.31 again combines sprinkling 
with the local comparison with random interlacements. Using this comparison and 
the branching process approximation of the random interlacement on the tree, we 
will show that at level Uk there are, with a high probability, only ©(n^) vertices 
contained in vacant clusters of size between In^rifc and n^, for some c G (0, 1), see 
Lemma 17.11 for the exact formulation. This statement is a weaker version of the 
so-called 'absence of components of intermediate size' which is usually proved for 
Bernoulli percolation. 

This will allow us to show that any component of of size at least /tn^, should 
contain at least K,nk/2 vertices x being in vacant components of size at least at 
level Uk- The sprinkling then shows that any two groups of size of such vertices 

are connected in V^, excluding two giant components with a high probability. 

We close this introduction with two remarks concerning our assumptions. 



Remark 1.4. The assumptions flA0l) - flA2l) are designed in order to include two classes 



of regular graphs: expanders with girth larger than clog |Vfc|, and typical realiza- 
tions of a random ci-regular graph. In the random d-regular graph case these as- 
sumptions also help us separate the randomness of the graph from the randomness 
of the walk. 

The fact that the typical realization of the random rf-regular graph satisfies as- 
sumption flAip follows from Lemma 2.1 of [LS08j . where they show it for ai = ^. To 
see that (]A2|) holds one can use the estimate on the second eigenvalue of the adja- 
cency matrix A of the random (i-regular graph of Friedman |Fri08j (or older results, 
e.g. |BS87t IFriQlj . which however only provide estimates for d even and not too 
small). Indeed, in |Fri08j it is shown that this second eigenvalue is 2\Jd — 1 + o(l), 
with a high probability. The largest eigenvalue of this matrix is d. This implies 
2]), since the generator of the random walk is given by ^ — Id. 



Remark 1.5. The assumption (lAll) does not imply (]A2p . This can be seen easily by 
considering two copies G, G' of a large girth expander with n vertices, choosing two 
edges, e = {x, y} of G and e' = {x', y'} of G', erasing e, e' and joining G, G' with 
two new edges and {y,y'}- The new graph is (i-regular. It satisfies (lAip . 

potentially with a slightly different constant than G. However, the new edges create 
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a bottleneck for the random walk, implying that the spectral gap of the new graph 
decreases to zero with the number of vertices n. 

The paper is organised as follows. In Section [21 we set up the notation. In Sec- 
tion [31 we prove an estimate of the form fll.6l) . The piecewise independent measure 
Q is constructed in Section [H Sections [SI [SI and [7] contain the proofs of Theorems II. 
11.21 and ll.3t respectively. 

Acknowledgements. The authors wish to thank Itai Benjamini for proposing 
the study of the vacant set on expanders and are indebted to Alain-Sol Sznitman 
for helpful discussions. This work was started when the third author was at ETH 
Zurich. 



In this section we introduce additional notation and recall some known results 
about random interlacements. 

2.1. Basic notations. Throughout the text c or d denote strictly positive constants 
only depending on and the parameters ai and a2 in assumptions (lAll) and (1A2|) . 
with value changing from place to place. The numbered constants cq, ci, ...are 
fixed and refer to their first appearance in the text. Dependence of constants on 
additional parameters appears in the notation. For instance c-y denotes a positive 
constant depending on 7 and possibly on cti, ^2. 

We write N = {0, 1, ... } for the set of natural numbers and for a G M we write 
[aj for the largest integer smaller or equal to a and define [a] = [aj + 1. In this 
paper we use In x for the natural logarithm and use Id to denote the logarithm with 
base (i — 1, 



For a set A we denote by |y4| its cardinality. 

Recall that we have introduced a sequence of finite connected graphs Gk = (V^, £k) 
in the introduction. We will always omit the subscript k of the sequence of graphs 
Gk and their sizes n^. In particular, we always assume that n is the number of 
vertices of G. For d as in f lAO|) . we will also consider the infinite (i-regular tree, 
denoted 1^ = (Vrf,Erf). 

We now introduce some notation valid for an arbitrary graph G = {V, £). We use 
dist(-, ■) to denote the usual graph distance and write x y, if x,y are neighbours 
in G. We write B{x,r) for the ball centred at x with radius r, B{x,r) = {y & V : 
dist(x, y) < r}. For A G V we define its complement A'^ = V\A, its r-neighbourhood 
B{A,r) = [J^^^B{x,r), and its interior and exterior boundary 

(2.2) diA = {x e A:3y e A^ , X ~ y}, d(,A — {s G A'^ : 3y G v4, x ~ y}. 

We write int(y4) for A \ diA. We define the tree excess of a connected set A (ZV , 
denoted by tyi{A), as the number of edges which can be removed from the subgraph 
of G induced by A while keeping it connected. Equivalently, 



2. Notation 



(2.1) 



\dx = \og^_i X = \tix/ ln((i — 1). 



(2.3) 



tx(A) = \£a\ - 1^1 + 1 
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where Sa stands for the edges of the subgraph induced by A. By a cycle we mean 
a sequence of vertices xi, . . . , Xfc such that xi = Xk and Xj+i ~ Xi for ell 1 < i < k. 
Note that tx(y4) = if and only of there is no cycle in A. 

2.2. Random walk on graphs. We use Px to denote the law of the canonical 
continuous-time simple random walk on G started aX x & V , that is of the Markov 
process with generator given by 

(2.4) A/(x) = 5^(/(y) - f{x))pxy, for / : V ^ M, X e V^, 

where p^y = l/c^x if a; ~ and p^y = otherwise, and denotes the degree of the 
vertex x. We write for Pj. whenever ambiguity would otherwise arise. 

With exception of Lemma 13.11 and Proposition 13. 2^ we will always work with 
regular graphs G, in which case is the same for every vertex x &V . We use Xt 
to denote the canonical process and (J-t)t>o the canonical filtration. We write 
for the restriction of Px to D([0,£], V) and P^y for the law of random walk bridge, 
that is for P^ conditioned on Xn = y. We write Ex, E^, E^y for the corresponding 
expectations. The canonical shifts on D{[0,oo),V) are denoted by Of. The time 
of the n-th jump is denoted by r„, i.e. Tq = and for n > 1, r„ = inf{t > : 
Xt 7^ Xq} o ^t-„_i + '^n-i- The process counting the number of jumps before time 
t is denoted by Nt = sup{A; : Tk < t}. Note that under Px, {Nt)t>o is a Poisson 
process on M+ with intensity 1, but this is not true under P^y. We write X„ for the 

discrete skeleton of the process Xt, that is X„ = X^-^. For < s < t, we use to 
denote denote the set of vertices visited by the random walk between times s and t, 
= {Xr:s<r<t}. 

Given A G V, we denote with and the respective entrance and hitting 
time of A 

(2.5) Ha = inf{t > : G A}, and Ha = Hao Or, + n. 

We write Ha for the discretised entrance time. Ha = Nh^- 

For the remaining notation, we assume that G is a finite connected graph. For 
such G we denote by vr the stationary distribution for the simple random walk on 
G and use vr^. for 7r(x). P stands for the law of the simple random walk started at 
TT and E for the corresponding expectation. Under assumption f lAOp the stationary 
distribution is the uniform distribution. For all real valued functions f,goTiV we 
define the Dirichlet form 

(2.6) V{f,g) = \Y. (/(^) - f{y)){9{x)-g{y))T:xP.y = - ^ A/(x)(?(x)7r.. 

x,ye.V x&G 

The spectral gap of G is given by 

(2.7) Xg = mm{V{f, /) : 7r(f) = 1, 7r(/) = O}. 
From |SC97j . p. 328, it follows that under assumption (lAOp . 

(2.8) sup \Px[Xt = y]- Tiyl < e-^°*, for all t > 0. 

x,yGV 



GIANT COMPONENT LEFT BY RANDOM WALK 



8 



A function : \^ — )■ M is called harmonic on A if Ah{x) = for all x E A. For 
two non-empty disjoint subsets A,C of V we define the equilibrium potential g\ q 
as the unique function harmonic on {A U Cy, satisfying g*\A = 1, g*\c = 0. It is 
well known that 

(2.9) gXcix) = P.[Ha < He], 

(2.10) V{gXc,gXc) = $^n[i^A > 

We define the isoperimetric constant of G as lg = niin{|i9eA|/|y4| : A C V, |v4| < 
1 1^1/2}. If assumption flAOp holds, then Cheeger's inequality ( |SC97[ Lemma 3.3.7]) 
yields clq < Ac < c'lg- The assumption (IA2p then implies the existence of a2 > 



such that 

(2.11) \deA\>a2'\A\, for all A; > 1 and A C 1/ with |A| < |V|/2. 

2.3. Random interlacement. Let us give a brief introduction to random inter- 
lacements. Although we will not directly use any results on random interlacements 
in this paper, random interlacements give a natural interpretation to the key result 
in Section 6. Consider an infinite locally finite graph G = (V, E) for which the simple 
random walk (with law denoted by P^) is transient. According to |Szn09e| ITei09] . 
the interlacement set on G is given by the trace left by a Poisson point process 
of doubly infinite trajectories modulo time-shift in G which visit every point only 
finitely many times. The complement of the interlacement set is called vacant set. 
Although the precise construction of the random interlacements on a graph is deli- 
cate, we give here a characterization of the law that the vacant set induces on 
{0, 1}^. For this, consider a finite set C V and define the capacity of K as 

(2.12) capiK) = J2P^[HK = oo], 

x&K 

with Hk as in (12. 5p . The law Q^j of the indicator function of the vacant set at level 
u is the only measure on {0, 1}"^ such that 

(2.13) Q.u[Wy = 1, for all y E K] = exp{-M ■ cap(ii')}, 

where {Wy}y^Y are the canonical projections from {0, 1}^ to {0, 1}, see (1.1) in 
[Tbi09] . 

3. Conditional probability estimate 

In this section, we derive in Proposition 13.61 an estimate on the probability that in 
a finite time interval [0, T], a random walk does not visit a vertex y in the boundary 
of a set A, given that it does not visit the set A. This estimate will be crucial in 
the analysis of the breadth-first search algorithm exploring the components of the 
vacant set, used in the proof of Theorem II. 11 

We recall first a variational formula for the expected entrance time. 

Lemma 3.1. [AFl Chapter 3, Proposition 41] For a non-empty subset A C V, 
(3.1) (EHa)-' = inf {V{f, f): f:V^R,f = lonA, 7r(/) = 0} . 
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The minimizing function /* in (13. ip is given by 

(3.2) rix) = 1 - 

Using this variational formula, we obtain the following estimate. 

Proposition 3.2. Let A and C he disjoint non-empty subsets of V and let g* = Qac 
be the equilibrium potential (12. 9p and f* the minimising function (13. 2 p . Then 

(3.3) V{g\gnfl - 2snp\r{x)\) < <V{g\g^MC)-'. 

Proof. We prove the right-hand inequality in (13. 3 p first. To this end, we modify 
the function g* such that it becomes admissible for the variational problem (13. ip of 
Lemma [3.11 We define the function ^ on by ^ = {g* — TT{g*))/{l — TT{g*)). Then 
g equals 1 on A and 7r(^) = 0, so we obtain from (13.10 that 

(3.4) E[Ha]-' < V{g,~g) = V{g\g*){l - Trig*))-'. 

Since g* is non-negative, bounded by 1 and non-zero only on C^, we have TT{g*) < 
7r(C"^) and the right-hand inequality of (13.31) follows. 

To prove the left-hand inequality in (13. 3p . observe that the maximizer /* of the 
variational problem (13. ip satisfies /* = 1 on A. Therefore 

(3.5) EIHa]-' = V{f\f*) > mf{V{g,g) : g : V ^ R, g = 1 on A, g = f* on C] . 

Since G is finite, the infimum is attained by a function g which satisfies the given 
boundary conditions on A and C and which is harmonic in {A U Cy. In particular, 
the process {g{Xt^HAuc))t>o ^ P^-martingale for any x E V. From the optional 
stopping theorem, it follows that g{x) = g*{x) + iIj{x), x G V, where the function ip 
is defined by 

(3.6) ^(x) = E4f*{XHa)l{Ha<HA}], for x e V. 
Therefore, 

(3.7) E[Ha]-' > V{g* + V-, + ^) > V{g\ g*) + 2V{g\ ^). 

Since equals on A, A^f* equals on (A U Cy and on int C, and Ag*{x) > for 
all X G diC (indeed, g* is non-negative on V and equal to on diC), we have 

(3.8) V{g\ij) = -J2 A(7^(x)^(x)7r,. > -I^U Yl A^?'(^)^- 

xediC x&diC 

Using again g* equals on diC and observing that A/ = — A(l — /) for any real- 
valued function / on V , as can be directly seen from the definition of A in (12.40 . we 
obtain 

(3.9) P(^^^) > I^U A(l-^7*)(x)(l-^7*(x))7r.. 

x&diC 

Since 1 — g* vanishes on A, while A(l — g*) = Ag* vanishes on {AU Cy and on 
int C, the right-hand side equals — |'?/'|oo^^(l — 1 — (?*) = — |'^|oo^^(5'*, 5'*)- Putting 
together (13.71) and (13. 9p and using (13.61) . we therefore obtain 

E[HAr'>Vig*,g'')il-2\ilj\^)>Vig\g'')il-2snp\f\x)\). 

x&C 
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This yields the left-hand estimate in (13. 3p and completes the proof of Proposition |32J 

□ 

In order to apply the left-hand estimate of (13.31) . a bound on sup^.^^; |/*(a;)| is 
required. We will derive such a bound in Proposition 13.51 below. In its proof we will 
need the following technical lemma. 

Lemma 3.3. Assume flAOp and consider r, s G N and x eV , such that tx(i?(a;, r -|- 
s)) < 1. Then for any y G diB{x, r + s), 

(3.10) Py[HB(x,r) < HBicc,r+sr] < c{d - 1)"^ 

Proof. We write B = B{x,r) and B' = B{x,r + s) and for every vertex z E B' we 
define = dist(a:, z). If tx{B') = 0, then B' is a tree and behaves like a random 
walk on N with drift, which steps left with probability p = 1/d and right otherwise. 

It is a known fact that the probability that a random walk on Z jumping with 
probability p to the right and 1 — p to the left started at x > hits R > x before 
hitting zero equals (see e.g. |Dur96] . Chapter 4, Example 7.1) 

— 1 

(3.11) , where g = (1 — p)/p. 

— 1 

The inequality (I3.10p then follows directly from (13. lip . 

We thus assume that tx{B') = 1. Let us call a vertex z in B' \ {x} exceptional, 
if z does not have d — 1 neighbours z' with r^' > r^. We claim that 

There are at most two exceptional vertices. All of them are at the 

(3.12) same distance (say p) of x and have at most two neighbours z' with. 

rz' < Tz 

To see this, consider an exceptional vertex z G B' . By definition, there is a pair 
Zi^Z2 of neighbours of z with r.^^^rz^ < r^. By considering geodesic paths from Zi 
and Z2 to x, one can extract a cycle in B{x,rz) containing z and exactly two of 
its neighbours Zi, Z2. By construction, this cycle has at most two vertices which 
maximize the distance to x. One of them is z. Second might be zx or ^2, in which 
case this vertex has the same distance to a; as 2; and is also exceptional. To show 
that there cannot be another exceptional vertex other than z (and potentially one 
of Zi, ^2), we suppose that there is one, we call it z' . By the same reasoning we 
can extract a cycle in B' containing z' with z' maximizing the distance to x. This 
cycle thus must be different from the one containing z. This is impossible since 
tx(i?') = 1. Similarly, if z has three or more neighbours Zi with r^^ < r^, then every 
pair of them can be used to extract a cycle, all of them being different. This is again 
in contradiction with tx(i?') = 1. With this we conclude (I3.12p . 

Let Yt = dist{B , XtAHg^ji^gr^c) ■ We compare Y with a continuous-time birth-death 
process Ut on {0, . . . , s -|- 1} given by the following transition rates 



(3.13) Pi,i+i = l-Pi 



2)/d, a i = p — r, 

l)/d, ifzG{l,...,s}\{p-r}, 



and such that the states and s -|- 1 are absorbing. More precisely, using (I3.12p . we 
can couple Y (under law Py) with U (started from s) in such way that Ut < Yj. for 
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every t > 0. This implies that 
(3.14) 



PylHs < H(^B')'=] is smaller or equal to the probability 



that U hits before s + 1, given that Uq = s. 

The last probability will now be estimated using a standard birth-death process 
computation. Let f{i) be the probability that U started at i hits and set h{i) = 
f{i - 1) - /(i), i G {1, . . . s + 1}. Clearly /(O) = 1, f{s + 1) = and the strong 
Markov property on the time of the first jump implies that h{i)pi^i-i = h{i + l)pi^i+i, 
i E {1, . . . , s}. Fixing h{s + 1) = 7 we use the above facts to get 

(3.15) 1 = /(O) > h{l) = 7 ■ P^'^---P^'^+^ = 2(rf _ _ 2)2. 

Pl^Q. . .Ps,s-1 2 

Moreover, conditioned on Uq = s, the probability that U hits zero before s + 1 is 
/(s) = 7, i.e. Prob[f/ hits before s + l\Uo = s] = f{s) = 7. Putting this together 
with fl3.14p and f l3.15p the proof of Lemma 13.31 is finished. □ 

We now apply the last lemma to estimate the probability that the random walk, 
started outside of the larger one of two concentric balls visits the small ball before 
time T > 0. 



Lemma 3.4. Assume that G satisfies f lAO|) and consider T > 0, r, s E N and x E V 
such that tx{B{x, r + s)) < 1. Then, for some c, c' > 0, 

(3.16) Py[HB{x,r) <T]< cT{d - 1)^' + e""'^ for all y E B{x, r + s)". 

Proof. As in the previous proof we write B = B{x,r), B' = B{x,r + s). From an 
exponential upper bound on the probability that a Poisson random variable with 
expectation T is larger than 2T, we have 

(3.17) PylHs <T]< PylHs < 2T] + e"^'^, 

where Hb is the entrance time for the discrete-time walk defined below (12. 5p . 

On the way from y to x (as in the lemma), the simple random walk must visit 
some vertex z E diB' . After reaching such vertex, it either hits B without exiting B' 
or it exits B' . The probability of the first event is bounded from above by c((i — 1)~^, 
see Lemma 13.31 When the second event occurs, the simple random walk must again 
pass through d^B' in order to visit x. At this point we can repeat the previous 
reasoning. However, before time 2T we can repeat this procedure at most 2T times, 
since we are considering a discrete-time walk. A union bound then implies 

(3.18) Py[HB < 2T] < 2Tc(rf - 1)"" 



and Lemma 13.41 follows by renaming constants. □ 

Finally, we prove the proposition that will allow us to use the left-hand side of 
the estimate (13. 3p on E[Ha\~^, derived in the beginning of this section. 



Proposition 3.5. Let G = {V,£) be a graph on n vertices satisfying flAOp . (]A2p 

and let A C V, s E (0, log^_i n] fl N such that \A\ < n/2 and tx{B{x,s)) < 1 for 
every x E A. Then 

(3.19) sup ^rlr-l <c\A\id~ir\og'n. 

y€V:dist{y,A)>s ^[-nA\ 
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Proof of Proposition \3.5[ In essence, the proof is an application of the estimate (12. 8p . 
which shows that the distribution of the random walk on G at time T = A^^ log^ n 
is close to uniform. From Lemma 13. 4[ we know that it is unlikely that the random 
walk started at y reaches a point x in A before time T and this will yield (I3.19p . 
We shall require the following rough bounds: 

Tl 

(3.20) — — - < E[Ha] < sup Ez[Ha\ < cnlogn, for some constant c > 0. 

4|^| 2GG 

The first inequality in (I3.20p follows from the right-hand estimate of (13. 3p with C 
chosen as A'^, (I2.10p . and our assumption that \A\ < n/2. To prove the last inequality 
in (I3.20p . observe that for t = 2logn/a2, assumption ( ]A2p and (12. 8 p imply 

(3.21) inf P,[Ha < 21ogn/a2] > inf P.[X2iog„/a2 e A] > {2n)-^ 

By the simple Markov property applied at integer multiples of t, it follows that Ha 
is stochastically dominated by t times a geometrically distributed random variable 
with success probability l/2n and (I3.20p readily follows. 

Let y be chosen as in the statement and let us first consider the expectation of 
Ha starting from Xt- From (12. 8p and our crude estimate (l3.2Up . we obtain, for any 

zeV, 

\E,[ExAHa]] - E[Ha]\ < J2 l^^l^T = -^1 - ^^'\Ez'[Ha] 

(3 22) ""'^^ 

< ^ e" log n < n^e~ 

z'eV 

We now apply this inequality to find an upper bound on Ey[HA]- Since Ha < 
T + Ha o 6t, the simple Markov property applied at time T and (I3.22p imply that 
for any z ^ V, 

(3.23) E,[Ha]<T + EAExAHa]] < T + n'e-'''^" ^ + E[Ha]. 

With the first inequality in (I3.20p . we deduce that 

,3.4, §^-.(--^V-,M.^. 

which is ample for one side of (I3.19p . To prove the other half of (13.191) . choose y as 
in the statement and apply the simple Markov property at time T to infer that 

Ey[HA] > Ey[l[HA>T}ExT[HA]] = Ey[Exj,[HA]] — Ey[l{HA<T}ExT[HA]] 
13:2211 2 
(3 25) > E[Ha] - n\-'°^ " - Py\HA < T] snpE,[HA] 

^ ' z&V 



> E[Ha] - 2n^e-'°^ " - Py[HA < T](T + E[Ha]). 

Applying (I3.16P to the probability on the right-hand side and rearranging, we find 
that 

(3.26) ^^-l>-c\A\id-ir\og'n, 

which together with (I3.24p completes the proof of Proposition 13.51 □ 
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A 



Figure 1. Proper J^A{y,r) (gray points) on a 3-regular graph with 
r = 5. 

We now analyse the distribution of the hitting time of a point y conditioned on 
the event that a certain set A is vacant. This estimate will be helpful for the analysis 
of the breadth-first search algorithm used in Theorem 11.11 

For any non-empty connected set A C V^, r > 1, and y G d^A we define 



(3.27) 



Ta^V-, r) = {-2 G -B(v4, r) \ a : ;z is connected to y in B{A, r) \A). 



Observe that y G J-A(?/,r). In the breadth-first search algorithm to be introduced 
in Section O the set J-'^(|/,r) can be viewed as the 'future of y seen from A . We 
say that J^Aiy.r) is proper when (see Figure [1]) 



(3.28) 



(i) tx(j-^(|/,r)) =0, 

(ii) y has a unique neighbour y in A, 

(iii) for any vertex y' ^ A\y, every path from y to y' 
leaves B{A, r)\A before reaching y'. 



Proposition 3.6. Let s G [2, (ai A l)ldn), A C V, A ^ with \B{A,s)\ < ^Jn, 
and y G dgA, such that J^A^y, s) is proper. Then, for any T > 1, 



(3.29) 



In P[HAu{y}>T\HA>T] + 



T {d-2) 



n d{d — 1) 



c\A\ (T\A\\Yi^n 



n \ [d 

P[HAU{y} > T] 



1 . 



Proof. We set 

(3.30) FliT)=P[HAuiy}>T\HA>T] p^Ha>T] ' 

and use results of |AB93] to estimate both numerator and denominator. Namely, 

by |AB93] (1) and Theorem 3, for uny A(ZV,t> 0, 

(3.31) 

^ ^ exp f - ^ ) < P[Ha > t] < (1 - 7r(A)) exp ^ 



^gEuaHa 



EoaHa 



E^aHa 
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Here is the quasi- stationary distribution for the random walk killed on hitting 
A. We will only need its following properties, see |AB93] Lemma 2 and Corollary 4, 

1 - tt{A) ^ EH A 



(3.32) 



< 



TT{x)p^y 1 - 7l{A) 



< EqaHa < EH A + A, 



G 



Observe that the left-hand side is bounded from below by n/(2|A|) > c^/n for A as 
in the statement. 



Writing A = AU {y}, a = ua and a 



a 



A' 



and applying f l3.3ip for A as well as 



for A to bound the conditional expectation fl3.30p . we obtain after rearranging and 
taking logarithm 

■ n{A) 



1 - 



(3.33) 



In 



<\nFliT)- 



T 



+ 



En, Ha EnH 



< In 



1 



1 - 



\gEcHa 

Using fl3.32p and the observation following it, we see that E^Ha > c^/n. Therefore, 
by expanding the function ln(l/(l — x)) around zero and using fl3.32p again, we 
obtain that the right most term in (13.330 is bounded from above by c|y4|/n, which 
can be included in the error term of (I3.29p . A similar reasoning implies that the 
left-most term in (I3.33P is bounded from below by — c|A|/n, which can again be 
accommodated into the error of (13.290 . 

The inequalities in (I3.32p further imply that < EaHA—EHA < and therefore 



T 



T 



cT 



Ul 



where in the last inequality we used (I3.32p again. The right-hand term in the 
last display is again smaller than the error in (I3.29p . since 
assumption s < \ \dn. 

Finally, we use Proposition 13.21 to approximate 1/EHa and 1/EH^. To this end 
we introduce C = B{A, sY and we set g = g^^, see (12. 9p . Then by Proposition 13.21 
we have that 



(3.35) 



T 



EH, 



TV{g,g) 



<TVig,g){niC) 



1 + 2 sup 

zee 



EyHf 



EH, 



- 1 



By (I2.10p . V{g,g) < ti^A). From the assumption B{A,s) < y/n it follows that 
7r(C)"^ — 1 < cn^^^'^ < c{d — 1)~*. Finally, the Proposition 13.51 implies that the 
supremum in (I3.35P is bounded by c|y4|((i — l)~'^ln^?2. Hence, the right-hand side 
of (I3.35P is smaller than the error term in (I3.29p . An analogous computation proves 
that T/EH^ is approximated by TV{g,g), where g = g"^^. A little bit of care is 

only needed when applying Proposition 13. 5^ since dist(y4, C) = s — 1. 

We have thus proved that InF^(T) is well approximated by T(V{g,g) — V{g,g)^ 
up to the error on the right-hand side of (13.290 . We now estimate this expression. 
Let y be the unique neighbour of y in A. By (12.101) . 

T 



(3.36) T{V{g,g)-V{~grg)) = - J^PJ^^ > Hc]~J2^^^^A > He 

zeA zeA 

Now we use our assumption that J^A{y,s) is proper. Due to (13.280 (111). for all 
z G A\{y}, there is no path from z toy using only vertices in B{A, r)\A. Therefore, 
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for such z Pz[Ha > He] = Pz[Ha > He], and f l336|) equals 

(3.37) ^ [Py[HA > He] - Py[HA > He] - PyW^ > He]] . 
Conditioning the first two terms on X,-^, since J-'A^y, s) is proper, we get 

(3.38) lil^y^^^ > - ^y^^^ > ^^]}- 

Since by assumption tx(J^4(?/, s)) = 0, these probabihties can be computed using 
the formula (13.111) for the random walk with drift. Setting g = l/((i — 1) we have 

(3.39) Py[Hj^>He] = '^~\\~J_^ , and P^[Ha > He] = 
Inserting this into (13.381) we obtain that 



(3.40) 



T{Vig,g)-Vi~g,~g)) 



T{d-2f 



cT 

< 



nd(d — 1) 

This completes the proof, since the error is smaller than the right-hand side of 

We now use the same techniques to control the hitting time distribution of a point 
with tree-like neighbourhood. 

Lemma 3.7. Let y E V be such that tx{B{y, s)) = for some s G [l,aildn). 
Then, for any T > 1, 

n T(d-2) c/Tln^n 

(3.41) |,„p|^^„>r| + -i^| <-(— + ! 

Proof. The proof follows the same lines as the previous one with A = {y} and 
without the conditioning, which is equivalent to controlling the numerator of (I3.30p 
only. The same reasoning as before implies that, for g = g* ^^^y \^'^H[Hy > 
T] - TV{g,g)\ is smaller than the right-hand side of (ICT]) . Using (CTI]) . flA]) . 
with q = l/{d — 1) again, we get 

(3.42) V{g, g) = n-'Py[Hy > Hs^yM = = + 0{id - 1)^), 

nl — n{d — 1) 

which finishes the proof. □ 

4. PlECEWISE INDEPENDENT MEASURE 

We now make another preparative step in order to prove our main results. In later 
sections, it will be convenient to split the random walk trajectory X[o,un] into smaller 
pieces and to treat pieces that are sufficiently distant in time as being independent of 
one another. Although this kind of independence does not hold under the random 
walk measure P"", we will in this section construct a new measure on the space 
of trajectories with the desired independence properties. In Lemma 14. H we then 
estimate the error we make when replacing P"" by this new measure. 

For the construction, we choose real parameters 

(4.1) L = n^, £=(lnn)2. 
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where 7 G (0, 1) will be fixed later. We consider an abstract probability space {Q, P) 
(expectation denoted by E) on which we define a sequence of i.i.d. random variables 
y*, i > 0, with values in D{[0, L],V) and the marginal distribution (defined in 
Section 122]) • We set a^, bi to be the start- and the end-point of Y\ ai = Yq, hi = ¥[. 
On the same space fl we further define a sequence of random variables Z\i > 0, with 
values in D([0,£], V). Given a^, bi, i > 0, the random variables are independent, 
conditionally independent of the sequence (Y^), and the random variable has the 
random-walk bridge distribution P^^ We call the F^'s segments and Z"s bridges. 

We now concatenate the F^'s and Z"s to obtain an element of D{[0, 00], V). More 
precisely, we define the concatenation mapping X from Q to D[[0, 00), V) as follows: 
For t > 0, let it e N and s G [0, L + i) he given by t = it{L + i) + St. Then, for 
t > 0, 



(4.2) Xt{Y',Z',Y\Z\...) 



Y::, iiO<st<L, 
Zl_^, iiL<St<L 



The mapping X induces a new probability measure Q = P o on D{[0,oo),V). 
We use Q"*, s > 0, to denote the restriction of Q to D{[0, s],V). The measure Q 
will be used to approximate P later. We control this approximation now. 

Lemma 4.1. For every fixed u > 0, the measures P"" and Q"" are absolutely 
continuous and there exist constants c, c' > depending only on 0:2 such that 



(4.3) 



dP"^ 



Proof. Let m' > -u be the smallest number such that u'n is an integer multiple of 
(L + i), and set m = u'n/{L + £) e N. Let further A be an arbitrary measurable 
subset of -D([0, u'n], V). Since P'"" and Q"" are the restrictions of P" and Q" " to 
D{[0, un], V), it is sufficient to prove the lemma with u replaced by u'. To this end, 
we set t2k = k{L + i), t2k+i = k{L + £) + L for G {0, . . . , m}, and write 

(4.4) P"'"[A] = J2 ^"'"[^1^*. =Xi,0<t< 2m]P"'"[Xt^ = Xi,0 < t < 2m]. 
By the Markov property 



m—1 

(4.5) P'^'^lXt^ =x„0<t<2m]= 7r{xo) H P^JX^ = X2u+i]Pi,,jX, = X2U+2]. 

k=0 

The construction of the measure Q implies that 

(4.6) Q"'"[A|Xt^ =Xi,0<t<2m] = P"'"[A|Xt, =Xi,0<i<2m], 

m—l 

(4.7) g"'"[Xt^ = x„0<t<2m] = 7t{xo) J] P^X^ = X2k+Mx2k+2). 

k=0 

Comparing (14.51) and (14.71) . it remains to control the ratio P^iXg = y]/7c{y). However, 
by (lA2p and (12. Sp . this ratio is bounded by 1 +ne~°'^^. Hence, (14. 4p is bounded from 
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above by 
(4.8) 

(1 + ne-"'^)"" Q"'"[A|Xi^ = Xi,0<i< 2m]Q"'"[Xt^ = Xi,0<i< 2m] 

< Q"'"[A](l + Me-'='"'"), 

where, in the last inequahty, we changed the constants to accommodate the terms 
polynomial in n. A lower bound can be obtained analogously. We have thus shown 

(4.9) Q"'"[A](1 - ne-"^°'") < P"'"[A] < Q"'"[A](1 + we-'^'"^'"). 

It immediately follows that P"'"- and Q"'"" are absolutely continuous. Moreover, the 
fact that f l4.9p holds for any event A in J^u'n yields directly the estimate fl4.3p . □ 

We end this section with a simple lemma which controls the number of jumps 
performed by segments and bridges, which will be useful several times later (see 
Subsection 12.21 for the definition of the jump process (A^i)t>o)- 

Lemma 4.2. 

(4.10) P[2-V<iVL<2n^] > l-e""^"". 
For any x,y E V, 

(4.11) P^y [Ne > In^ n] < c exp{-c' In^ n}. 

Proof. Under the measure P, the random variable A^^^, has Poisson distribution with 
parameter L = ri^ . Hence, f l4.10p follows by a standard large deviation argument. 

In order to prove (14. lip , note first that A'"^ is not necessarily a Poisson random 
variable under P^y, due to the conditioning on the position of the endpoint. However, 
using (12. 8p for the last inequality, we have 

sup t.^ [e '\ - sup h^[e '\Xi> -y\< — _ 

(4 12) "^''^^^ "^'^^^ inix,?;ey -f1r[-^£ - 

^ ■r'pTf , <2n-exp{cl..^,} 

m.i^^y(.v Px[Xi = y\ 

for n larger than some d . The exponential Chebyshev inequality then implies claim 
(ii) for such n. Adjusting the constants to make the claim valid for all n finishes the 
proof. □ 

5. Sub-critical regime 

In this section we prove Theorem II. 1^ which states that if u > m^, then the 
maximal connected component Cmax of is typically of size O(lnn). We will do it 
by analysing a breadth-first-search (BPS) algorithm which explores the component 
Cx of the vacant set containing a given vertex x. This algorithm is similar to the 
one used in the Bernoulli percolation case, but has some important modifications 
due to the dependence in our model. 

We start the proof by reducing the complexity of the problem. We set, as in 
Section m L = n"', i = In^n, with 7 G (0, 1). Due to Lemma [4.11 it is sufficient to 
show that Theorem 11.11 holds with with P"" replaced by Q"" = P o A'^^|£)([o,un],y)- 
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Since we are looking for an upper bound on the vacant set, we can disregard 
the bridges Z* in the concatenation X (cf. (14. 2p ). More precisely, we set m = 
[un/{L + and we observe that P-a.s. the vacant set 

(5.1) V: = V\{X,iiY^),iZ')):te[0,un]} 
is a subset of the vacant set left by segments, V", 

(5.2) V" := 1/ \ Ui<^ Ran Y\ where Ran Y' = Yf^^^ . 

Let Cmax = C^ax Cx = bc the largest connected component, and the compo- 
nent containing x of V = V", respectively. Then the inequality 

(5.3) P[|C^ax| > i^logn] < nP[|C,| > Klogn] 

implies that to prove Theorem 11.11 it is sufficient to show the following proposition. 

Proposition 5.1. Let G be a connected graph on n vertices satisfying the as- 
sumptions flAOp . flAip . flA2p and let u > U-^. Then for every a > there exist 
1 < c, K < oo not depending on n (but depending on d, u, a\, such that 

(5.4) P[|C^| > indn] < cn-"-^ 

Proof. We prove this proposition by analysing the following BFS algorithm. During 
the run of the algorithm, all vertices in V are in one of four states: explored- vacant, 
explored-occupied, not-explored or in-queue. The set of vertices with the state in- 
queue is organised as a queue Q, that is it is ordered, the vertices are added to its 
end and removed from its beginning. The vertices in Q wait to get explored. 

Further, the state of any index i G {0, . . . , m — 1} of a segment Yi can be either free 
or tied. Note here that these states do not change the behaviour of the algorithm, 
but will be used for its analysis. Their meaning will be easier to understand as we 
get to Lemma [5.21 

When the algorithm starts, all vertices different from x are in not-explored state, 
X is in-queue, Q = (x), and all indices z G {0, . . . , m — 1} are free. 

At the step k of the algorithm, the first vertex y of the queue Q is removed from 
Q. If y ^ V, then the state of y is changed to explored-occupied, and all indices i of 
segments intersecting y (i.e. {i : y E F*}) become tied. On the other hand, if ?/ G V, 
then the state of y changes to explored-vacant and all non-explored neighbours of y 
in G are placed at the end of Q, in other words, their state changes to in-queue. To 
avoid ambiguity, we suppose that V is equipped with an ordering and the neighbours 
of y are added to Q according to this ordering. 

The algorithm stops if the queue Q is empty, or if the set of explored-vacant 
vertices has more than Kldn vertices. Since this set is subset of Cx by construction, 
we know that only in the second case we have \Cx\ > Kldn. Hence, in order to 
establish Proposition 15.11 one only needs to show that 

, X there are K,c> such that, with probability at least 1 — cn^'^^^, 
^ ' ' the algorithm finishes because the queue gets empty. 

To analyse the algorithm we need more notation. Throughout this section we fix 

(5.6) r = (71dldn) V2. 
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We let EVfc (EO^, IQ^) stand for the (random) set of vertices in the explored- vacant 
(explored-occupied, in-queue, respectively) state before the beginning of the fc-th 
step of the algorithm. Similarly, Ffc,Tfc denote the sets of free and tied indices at 
this moment. We set E^ = EO^ U EV^ and let i/k be the vertex being explored in the 
k-th step. In particular yi = x, EVi = EOi = Ti = 0. Let fcmax be the step when 
the algorithm finishes, 

(5.7) fcmax = min{/c : |IQfc| = or |EVfe| > K\dn}. 

Observe that by construction EOfc C deEVk and thus E^ C (EV^ U de'EMk)- Since G 
is (i-regular, and since exactly one vertex is explored at every step, this implies 

(5.8) \^k\ = k — 1 < dKldn, for all k < fcmax- 

We further define a filtration {Ak)k>i) where the cr-algebra Ak contains all informa- 
tion discovered by the algorithm before the fc-th step, that is 

(5.9) Ak = a(EV,-, EO^-, IQ^, F,-, T,- : j < A fc^ax)- 

Observe that, due to the ordering that we use while adding vertices to Q, the random 
variables yi, . . . , y^-i are cr(EOfc, EVfc)-measurable. 

To prove Proposition EH] we analyse the process recording the length of the queue, 
qk = \\Qk\, I < k < fcmax- We use = qk+i - qk, < k < /^max, to denote the 
size of its jumps. Since the graph G is (i-regular, in step k, at most d — 1 {d if 
k = 1) vertices are added to Q, and every time exactly one vertex is removed from 
it. Therefore, ri G { — 1, d — 1} and G {—1, . . . ,d — 2}, for k > 2. 

Roughly speaking, to prove (15. 5p we will show that the process (q'fc)fc<fc,„ax has a 
'down-drift'. For this, we need a lower bound on the probability that = —1 given 
the past Ak of the algorithm. Since = —1, whenever yk ^ V", we have, on the 
event {k < fcmax}, 

^ ^ P[rk = -l\Ak]>P[yk^V\Ak] = P[ykeUi<mRanY'\Ak] 

(5.10) . ., -, 

The reason why we have made the distinction between the free and tied indices 
is made clear in the short lemma below. 

Lemma 5.2. Let k > 1 and k^, = Afcmax- Then, conditioned on the a-algehra Ak^, 
the collection {Y^)i^^^^ is i.i.d. with marginal distribution P^[-\H^^^ > L]. 



Proof of Lemma \57M Let H = (EVj, EOj, \Qj, Fj, Tj)j<kt be the whole history of the 
algorithm until the time /c*. We use H = (yj,Oj,Qj, Fj,Tj)j^k to denote possible 
outcomes of H, here is a positive integer. Since the F^'s have marginal distribution 
P^, it suffices to prove that for any H such that P[H = H] > and any measurable 
subsets Ai of D{[0,L],V), 

(5.11) P[ni<,F-{Y'eA,},H = H] = fl[P[Y'eA,\RanY'nE-k = (a]\p[H = H], 

where Ej = Oj U Vj. Let us now analyse the event H = in detail. Let yj = 
Ej^i \ Ej be the vertex explored in the j-th step in the history H. If this vertex is 
vacant, that is yj G Vj+i \ Vj, then we know that yj ^ nj<mRany . On the other 
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hand, if it is occupied, that is yj G Oj+i \ Oj, then necessarily yj ^ nigi?_^j RanF*, 
yj G UjgTj^i RanF* and yj G rii^Xj+iXTj RanF*. Therefore, the event H = H can be 
written as 

RanFi) fl Q < {yj ^ ni^F.+i RanF*) 

(5.12) i<fc:VS+i\vs-^0 i<fc:O,+i\o,^0 

n (y, G U,6T,+i Rany^ n (y^ G nieT,^,\T, Rany^} 

Collecting the events containing y* with i G -F^, using Fj D for all j < k, this 
can be rearranged as 

(5.13) Pi (Rany^ n F^ = 0) n f{H, {RanY' : i G T^)), 

where / is some event depending only on H and Rany* with i &Tj,. Inserting this 
expression for H = H into (15. lip and using the independence of y*'s under P, the 
lemma follows. □ 

Lemma [5.21 implies that, on the event {k < fcmax}; 

P[rk = -l\Ak] > 1 - P[yfc ^ fl Rany^lA 

(5.14) ieFfe 

= l-(^[^E,ute.}>^I^E,>i^])"'='. 

To bound fl5.14p . we will use Proposition 13.61 with A = E^, y = yk E deE^ and 
s = r, see below (15. 5p . We first check its assumptions: Inequality (15. 8p implies that 
for n > ck, B{Ek,r) < \Ek\{d — lY < ^Jn] k >2 implies 7^ and is connected 
by construction. For k >2, let Vk = {J^Ekiyk,r) is proper}, see (I3.28p . 

Take > such that U-^{1 + EuY < u. Since r = 7 Id Id n and L = rC , the error 
term in (I3.29P is smaller than Ci^-n'^"^/ Inn which is much smaller than the leading 
term. Hence, for n > Cu,a2,K, on the event Vk, we have by Proposition 13.61 that on 

{k <^ fcmax}; 

(5.15) > LIff,. > L] < exp{-^|;^-^L_l_}. 

We further define Qk = {\^k\ > u^n^^'^{l +£u)^}. Observe that both Vk and Qk 
are ^fc-measurable. Inserting (I5.15P into (I5.14p and using the definition (11.21) of m*, 
we get 



(5.16) 



Ph^-l|AI>[l-exp{ } 



2 

1 



d 



d-l 

for some 5„ > on {/c < /cmax}, provided n > Cu,^- 

To proceed we need to control the occurrence of {Qk H VkY- Observe that, for 

k ^ fcmax) Gk ^ ^fcmax- 

Lemma 5.3. For every a > there exists c = c„^u,K,'y such that P[Qkms^J — cn^'^^^ . 
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Proof. We set M„ = max^gy \{i < m : 3 z}\ and we define the event Q = {M„ < 
Id^n}. Observe that on Q we have by (15. 8p 

(5.17) |Ffc| = m- \Tk\ >m - \EOk\ n > m - dKld^n, 

which is larger than U-),n^~'^{l + 6uY for n > Ck,u- Therefore, Ql^^^ ^ 0'^ for n > Ck,u- 
It remains to bound P[^'^]. First, note that, 

P[z G RanF^] = P[H, < L] < P[Nl > n^^] + P[ uj"7^ {Y; = z}] 

Using the bound above and an exponential Chebyshev-type inequality, we obtain 
that P[\{i < m : Y' 3 z}\ > Id^ n] < c„,.^e-^'^'". Summing over z we get P[Q^] < 
Cu,a,'yn~'^~^ and the lemma follows. □ 

We further control the number of steps for which Vk does not hold. This is the 
content of the following proposition whose proof is postponed to the end of the 
section. 

Proposition 5.4. There are at most crK^ steps of the algorithm for which 
occurs. 

To show (15. 5p we now couple the process q with another process {q[)k>i which is 
a random walk with drift such that r[ = q'f^_^_^ ~ I'k ^ d — 2} and P[r^ = —1] = 
(1 + (see 05.161) ). This implies that E[ry < 5'^ for a constant 5'^ < 0. 

The coupling is constructed so that q' can be used as an upper bound for q. This 
is done as follows. Let A; G {2, . . . , fcmax — 1}- On {Qk H VkY we take independent 
of q. On Qk n Vk we require that t'^ = {d — 2} whenever > 0. This is possible 
because P[rk > 0] < P[r[, = {d - 2}] on Qk n Vk, due to f l5J6|) . For = 1 and 
k > kmux, 'f'k is independent of q. 

As initial condition we take q'^ = crK'^{d — 2) + d. Intuitively speaking, this gives 
q' a security zone, for the steps in which Vk does not hold. With this setting, the 
Lemma [5.31 and Proposition 15.41 imply that > qk for all k < fcmax with probability 
larger or equal to 1 — cn^'^^^. 

We can finally show (15. 5p . The probability that the algorithm finishes due to 
I EVfe^^^ I > ii' Id n is bounded from above by 

P [ niin qk > O] < en-'-'' + P[ min > O] < en-'-'' + P[g^idn > 0] < cn,-'-^. 

k<Kldn k<Kldn 

for K, d large enough, by an easy large deviation estimate for the random walk g', 
which has a negative drift. This finishes the proof of Proposition 15.11 and conse- 
quently of Theorem 11.11 □ 



It remains to show Proposition 15.41 The next lemma is the key step in its proof. 
It controls the tree excess of small (non-necessarily ball-like) sets. 

Lemma 5.5. Let G = (y,£) with \V\ = n satisfy fEOj) and f lXT]) . Then for all 
K > 1 and all connected sets A G V such that \A\ < nldn 

(5.19) tx{A) <CK^ =: a{K). 
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Proof. Let Ga = {A, Sa) be the subgraph of G induced by A and let s = ai \dn. We 
call a cycle in Ga short if it has no more than 2s edges, otherwise we call it long. 

Roughly speaking, the strategy to prove the lemma will be to erase edges belonging 
to short cycles, then to bound the amount of edges that could be still removed after 
that. 

Fix a short cycle C and let Oc = {y E A : C C Bc^iy, s)}. Since A is connected, 
either Oc = A or \0c\ > s. Further, since flAip holds for G, it holds also for Ga- 
Therefore, if C, C' are two distinct short cycles, then Oc and Oc' are disjoint. This 
implies that if Oc = A for a short cycle C, then there is only one short cycle in 
A and that in any case, there are at most \A\/s = n/ai short cycles in Ga- From 
every of these disjoint short cycles we can erase one edge and Ga remains connected. 
Hence we erase at most n/ai edges in this step. 

After this removal, we obtain a graph = {A, £') with girth larger than 2s. 
Recall from (12.31) that tx(y4) = \£a\ ~ \A\ + 1. Hence, since Ga and are both 
connected and was obtained by removing no more than 1 + k/oi edges of Ga, 

(5.20) tx(A) < l + /€/ai + tx(G^). 

To estimate the last term on the right-hand side, consider the set Z) = {(a;,y) G 
A^ : distG^(x,?/) > s}. Let 7 = (xq, xi, . . . , Xm, a^o) be a (necessarily long) cycle 
in G^. By removing the edge {xo^Xm}, the size of the set D increases at least by 
(| — 1)^. Indeed, before removing this edge any pair (xj,^^), for < i < | — 1, 
0<m — j<| — 1 was not in D. However, after removing {xo,Xm}i such a pair 
must be in D, since otherwise there would be a path in G^ connecting Xi and Xj, not 
passing through the edge {xo,Xm} and having length at most s, thus there would 
be a short cycle in G'^ which is not possible. Since the size of D is at most it 
is not possible to remove more than |74p/(| — 1)^ edges from G^ while keeping it 
connected. Hence, 

UP 

(5.21) tx(G'4) < 7 — -. r^-, which, for n > c, is smaller or equal to IQk^ /ai'. 

(s/2 - 1)^ 

The claims and fl5:2T]l imply that tx(A) < 1 + (k/ch) + (16kV«i^) < ck^ for 

n > d . Lemma [5.51 now follows by possibly adjusting the constants. □ 



Proof of Proposition \5.4\ The algorithm defined in the beginning of the proof of 



Proposition 15.11 induces a natural random tree structure T = (E^^^^, £t)- Namely, 
{y, z} G £t if and only if z was added to the queue during the exploration of y or 
vice- versa. By (15. 8p we have |Efc^^^| < dKldn =: nldn. 

We now finish the proof of Proposition 15.41 in three lemmas which respectively 
control the number of yk's for which (i),(iii), or (ii) of (I3.28P do not hold. It is worth 
to remark that the arguments in these lemmas are purely deterministic and do not 
depend on the fact that T results from the previous BFS algorithm. 

We start dealing with the condition (i) of (I3.28p . Recall the definition of ^(k) in 
Lemma [5.51 and that r = (7 Id Id n) V 2. 

Lemma 5.6. Let B = {yk : k < /cmax (^'^d tx{J-'Ef.{yk,r)) ^ 0}. Then, for large 
enough c, \B\ < 2ra(2K,) for all n > c. 
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Proof. For i < /cmax, we define inductively a sequence 7^ of paths in V as follows. If 
Hi ^ B or if Ui belongs to IJ^-^- Ran7j, then 7^ = 0. li yi e B and yi ^ IJ^.^- Ran7j, 
then there is a cycle in J^EXVii^) definition of B and this cycle is unique by 
Assumption f lAip (note that tx(J-'E.(?/i, r)) < 1 since r < aildra for n > c). In this 
case, we define 7^ as the unique path in J^eXUii from to this cycle, concatenated 
with the self-avoiding path exploring the whole cycle in one of the two directions. 

Set A = {1 < i < /cmax : 7i 7^ 0} and observe that | Ran7j| < 2r, for all i G A, 
and B C IJ^g^ Ran7j, hence \B\ < 2r\A\. 

It remains to show that \A\ < a (2k). Assume the opposite. Let Aq be any subset 
of A with q;(2k) + 1 elements, set R = E^^^^ U IJieAo ^^^li- Obviously, 

(5.22) \R\ < \Ek^J + 2r{a{2K) + 1) < 2Kldn, for n > c. 
We claim that 

(5.23) tx{R) > a{2K) + 1, 

which together with (15.221) contradicts Lemma 1531 and hence proves Lemma 1531 The 
estimate (I5.23P will follow if we can show that for all i G Aq, we have tx(i?j_i) < 
tx{Ri), where 

(5.24) Ro = 0, and Ri = {yi, . . . ,y,} U [j Ran 7,, 1 < 2 < | | . 

If Ri-i n Ran7j = 0, then this last claim is immediate, because 7^ then contains an 
additional cycle disjoint from Ri-i. Suppose now that Ri-i fl Ran7j 7^ 0. We will 
now find a cycle in Ri using an edge that is not already present in the graph induced 
by Ri-i- This will again imply that tx(_Rj_i) < tx(i?j), because by removing such 
an edge the graph remains connected and still has the graph induced by Ri-i as a 
subgraph. To find the cycle, note that yi ^ Ri-i, because i can be in Aq only if 
7j 7^ 0, which by construction can only happen if Ran7j fl {yi} = for all j < i. 
Let yi be the parent of yi in the tree T. Then by construction of T, yi G Ri-i- We 
now exhibit a cycle in Ri as follows: we start at yi and connect yi to yi. We then 
follow the path 7^ from yi to the first vertex xq belonging to Ri-i. Since Ri-i is 
connected, we can close our cycle by concatenating our path with a non-intersecting 
path from xq to yi using only vertices in Ri-i and therefore not intersecting the 
previously constructed path from yi to Xq. We have thus found a cycle in Ri using 
the edge {yi, yi}. Since yi ^ Ri-i, this edge is not present in Ri-i and it again follows 
that tx(_Rj_i) < tx{Ri). We have therefore proved fl5.23p and thereby completed 
the proof of Lemma 15.61 □ 

We now treat condition (iii) of f l3.28p . 

Lemma 5.7. Let B = {yk : there is path in B{Ek,r) \ from y^ to Ek\yk}, where 
yk is the parent of yk in T. Then, forn > c, \B\ < 2ra(2/t). 

Proof. The proof is analogous to the previous one. We define a sequence 7^ of paths 
in V as follows: If yi E B and yi ^ |J^.^^ Ran7j, then let 7^ be a self-avoiding path 
connecting yi to Ej = {yi, . . . ,yi--i}, whose first vertex after yi is in J-'E,(?/i,T) and 
whose length is at most 2r. Provided n is large, such a path exists for any yi G B, 
because tx(_B(t/j,r)) < 1 (cf. (lAip . (15. 6p ). Otherwise, we set % = 0. Defining A, 
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Aq, R and Ri as in the proof of Lemma [5.61 we can again prove (15.221) and fl5.23|l . 
The argument is the same as the one used below fl5.23p . except that we now only 
have to consider the case Ri-i fl Ran7j ^ 0. □ 

Finally, we treat condition (ii) of f l3.28p . 

Lemma 5.8. With B = {yt : yk is neighbour in G of two vertices in E^}, \B\ < 
a{2K). 

Proof. In this case we can remove the edges between yi and Ej which are not in 
£t from the subgraph of G induced by E^^^^^ while keeping it connected. Since 
|Efc^^^| < /tldn, Lemma [5.51 implies the result. □ 

Proposition 15.41 follows easily from last three lemmas. □ 

6. Super-critical regime 

In this section we prove Theorem IL2I stating the existence of a giant component 
for u smaller than m*. Since the proof is rather lengthy we first briefly outline its 
strategy. The strategy is inspired by the methods used for Bernoulli percolation. It 
has two major parts: First, we consider a modification of the piecewise independent 
measure and for such modification, we prove the existence of a sufficient amount 
of mesoscopic clusters even under a slightly increased value u„ > u. Second, by 
decreasing u„ back to the original value m, we prove that these clusters are connected 
by sprinkling. Both these parts are however rather non-trivial, due to the presence 
of the dependence. 

To construct the mesoscopic clusters, we first show in Section [6^ that the vacant 
set left by segments (see (15. 2p and (I6.14p below) on G locally resembles the vacant set 
of random interlacement on the d-regular tree = (Vd,Erf) (Proposition 16.30 . The 
behaviour of the random interlacement on T^^ is well known |Tei09j and its clusters 
can be controlled in terms of a particular branching process. This branching process 
will be super-critical for m's considered in this section. 

The control by the branching process allows us to construct a sufficient amount 
of mesoscopic clusters for the vacant set left by segments. Since we are looking 
for a lower bound on the vacant set, we however cannot ignore the bridges as in 
the previous section. In Section 16.31 we show that the mesoscopic clusters of the 
vacant set left by segments are robust and the addition of the bridges to the picture 
typically does not destroy them, see Proposition 16.61 

Finally, in Section 16.41 we use a sprinkling well adapted to our model to prove 
Theorem 11.21 As discussed in the introduction, in this sprinkling we erase randomly 
some segments, possibly in the middle of the trajectory. This can possibly disconnect 
the trajectory. Therefore, to be able to extract a nearest-neighbour path in the 
end, we must add many additional bridges to the picture; the robustness proven in 
Section 16.31 must take them in consideration. 

6.1. Preliminaries. We establish first the following technical consequence of as- 
sumption flAip which will be needed later in this section. 
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Lemma 6.1. Let G = {V,S) be a graph satisfying assumptions (IAO|) . flAl|) . Set 
R = [ai Id raj to be the radius of flAl|) and r = R — A, for some A G {1, 2, . . . }. 
Then 

(6.1) \{x:tx{B{x,r)) = 0}\ > {1 - {d - 1)-^)\V\. 
Proof. Let us consider the sets 

(6.2) A = {x: tx{B{x, r)) = 1} and A = {x : tx{B{x, R)) = 1}. 

We first study the structure of the graph G restricted to A. For every point x E A, 
there is exactly one cycle in B{x,R). This cycle should contain at most 2R + 1 
vertices, otherwise it cannot be contained there. If C C is such a cycle, we define 
£c = diamC = [|C|/2J and Nc{s) = {x : G C B{x, s)} for s < R. It is easy to see 
that dMl) implies Nc{R) = B{G,R- ic). 

We now prove the following claim: the subgraph of G induced by Nc{R) is com- 
posed by the cycle G with disjoint trees rooted at its vertices with depth R — Cc- 
Indeed, the graphs attached to every y E G should be trees because otherwise flAll) 
cannot hold. To see that they must be disjoint, suppose that they are not, that is 
there are two points y,z E G such that y and z are connected in Nc{R) \ G. This 
connection must be shorter than 2{R — ic) + 1. Joining this connection with the 
shortest connection of y and z in G, which is shorter than ic, we obtain a cycle 
different from G of length at most 2R — + 1, which is contained in B{y, R). This, 
however, contradicts f lAip . 

The claim proved in the above paragraph implies that 

(6.3) \Nc{R)\ = \G\ + (d - 2)|C|(1 + ■ ■ • + (d - l)«-^^-i) = |C|(d - i f-'^, 
Since Nc{r) is either empty or has a similar structure as Nc{R) 

(6.4) \Nc{R) \ Nc{r)\ > \G\{d - l)^-^^((d - 1)^ - 1) > |iVc(r)|((rf - 1)^ - 1). 

By our assumptions, the set A can be written as a disjoint union A = ufLiNci{R) 
for some M G N and cycles Gi, . . . , Gm- Similarly A = Ui^uNdij) for some U C 
{1, . . . , M} which contains indices of cycles shorter than 2r + 1. Therefore, using 
flOj) and flOll . 



(6.5) 



ieu ieu 
>Y.\NcAr)\{d-l)'' = id-l)'^\A\. 



ieu 

Hence \A\ < \V\/{d- 1)^ and thus = \{x : tx{B{x,r)) = 0}| > (1 - (d - 

i)-^)\v\. □ 

We now collect some notation used in the proof of Theorem 11.21 In what follows, 
we write V = for the set of vertices of the tree and denote by a its root. In 
order to describe the clusters of random interlacement on the tree we define the 
function / : V — )■ M as 



(6.6) f{z) 



if 2 7^ o. 



d(d-l) 

M for 2; 
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We let Q* stand for the law on {0, 1}^ which associates to the vertices z eY inde- 
pendent Bernoulli random variables with success probability e~'^'^^^\ The following 
result of |Tei09j provides the connection between this Bernoulli percolation and ran- 
dom interlacement on T^. This result will not be used in this paper, but is quoted 
here in order to provide the natural interpretation of the model that we have just 
introduced. 

Theorem 6.2 ( |Tei09j .Theorem 5.1 and (5.7)). The connected component Co C V 
containing the root o has the same law under characterised by f l2.13p as under 

Note that, under the law Q* the cluster Co C V can be regarded as a branching 
process, where the ancestor in generation is born with probability exp {— ""^zf}' 
and with binomial offspring distribution with parameters d — 1 and Pu, where 

u{d-2y 



(6.7) p. = exp, 

In order to deal with this branching process it is useful to define the expected number 
of offsprings as well as its logarithm in base d — 1: 

uid — 2)^ u 

(6.8) mu = [d- l)pu, and = Idm^ = 1 - -773 — ttttI 7T = 1 • 

d[d — 1) m[d — 1) Ui, 

Observe that for u < u^, we have > 1 + c„ and Vu € (c„, 1), for a constant c„ > 0. 

For u G (0,u^), it will be convenient to fix a small e = e{u) > such that the 
slightly increased intensity u(l + e) satisfies 

(6.9) M(l + e)< — - — and 7 + -pe < 



4 4 2(m + mJ' 
which by (16. 8p implies that 

(6.10) - = 1 - ^ Q + ^e) > 0. 

Finally, we define 

6.11 p = < , and 7 = 7 m = — — — < — . 

^ ^ 100 100 ' n J 2 2 

When u > (which we allow in Section 16. 2p we only require a weaker condition 

(6.12) < 7 < /3 = ai/100 < 1/100. 

We recall from Section H] the segments with length L = rP constructed on the 
probability space (fi, P). We define 

(6.13) M„ = \un/{L + £)] , for m > 0. 

We consider the vacant set left by segments V" = V \ Uo<i<Mu RanF*, and the 
corresponding random configuration ^ {0, 1}^ defined by 

(6.14) = ly. = l{r \Uo<KAf„RanF^}. 
Given a configuration rj G {0, 1}^ or G {0, l}'^, let 

, Cxirj) be the connected component of supp r) := {y eV : ri{y) = 1} 

containing the vertex x, 
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and Cma.x{v) be the largest such component. 
For any fixed vertex y E V, we define 

(6.16) By = B{y,p\dn), B'^ = B{y,5/3\dn) C V. 
We also set 

(6.17) B = 5(o,/31dn), B' = fi(o, 5/3 Id n) C V. 
If y has a tree-like neighbourhood of radius 5/3 Id n, 

(6.18) there is a graph isomorphism (p : By ^ M' such that (j){y) = o. 

In order to make the formulas less complicated, we will mostly identify the vertices 
of G and of T^^ linked by this isomorphism and omit (p from the notation. The vertex 
y is always given by the context. In particular, for z E By we define f{z) = /(0(z)). 

6.2. Approximation by random interlacements. With all notation in place, we 
can now approach the proof of Theorem 11.21 In this section we show that, provided 
tx^B'y) = 0, the component of the set HBy = supp ^„ fl By containing the centre 
y of By can, up to a small error, be controlled from above and from below by the 
branching process introduced in (16. 6p and below. Note that for the next proposition 
it is not necessary to assume that u < u^. 

Proposition 6.3. Assume (lAOp and (lA2p . and suppose that tx{B'y) = 0. Then 
for any m > 0, e G (0, 1), we can construct random sets and C"^^^*^-* C V 

distributed as Co under Q^^^^^^j and Q*(^_^-) such that 



C"(i+^)nBcc,(iB, -e^) cc"(i-), 

for all u e {u{l - I), u(l + |)) 



> 1 _ P r)~^^ 



Remark 6.4. Proposition 16.31 can also be interpreted as a control of the component 
of V" n By by random interlacement on T^. Indeed, due to Theorem 16.21 the sets 
(^u{i±e) j^g^yg same distribution under the Bernoulli measure Q*(i_|_£-) as under the 
random interlacement measure Qu(i±e). 

Proof. Throughout this proof, we write B, B' rather than 5^, By. Our strategy 
resembles the proof of Theorem 5.1 in |Tei09j . We first poissonise the number 
of trajectories entering in the definition of the configurations E,u for u G (u(l — 
e/2),u(l + e/2)), see (16.141) . To this end we introduce two independent Poisson 
random variables 11+ and n_ defined on (f2, P) with parameters un^~'^{l + ^) and 
un^~"'{l — ^), independent of all previously introduced random variables. We are 
going to compare with the configurations ^4. defined by 

(6.19) ^_ = l{r\Ui<n_Rany^} and ^+ = l{V \ Ui<n+ RanF^}. 
Clearly, by a large deviation argument, since M„ = urv^^^'il + o(l)), 
P[e+ <iu< for all u e (uil - e/2),u{l + e/2))] 

> P[n- < Mu(i-e/2) < Muii+e/2) < H+J > 1 - c^,„,, exp{-cn^-^}. 
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Figure 2. The trace left by some pieces in B and the set D^. The 
black and white circles represent respectively the values and 1 for 
the variables ^±{z) defined in fl6.22p . 



Next, we will dominate ^± from above and from below by a collection of i.i.d. 
Bernoulli random variables. For every z & B, we define the set Dz as the set of 
descendants of z in B, that is 

Dz = {z' & B : any path from z' to y meets either z or some vertex in -B^}, 

see FigureEl Consider the following disjoint subsets of -D([0, L], V), cf. |Tei09] (5.3), 

Wz = {Y e D{[0,L],V) : z G RanFnfi C Dz}, z e B, 



(6.21) 



W = [Y e D([0,L], V) : RanFnE ^ 0} \ U ^z- 

zeB 



In particular, all trajectories in W must enter B, then exit B' and enter B again, 
see Figure [2J We define the random configurations ^_ G {0, 1}-^ on {Q, P) by 

(6.22) K J \ T z, J, 

e+(z) = 1{Y' i Wz, Vz < n+}, zeB, 

see Figure [2] again. Since the sets Wz are disjoint for distinct z's, the variables i+{z) 
will be independent for distinct z's due to the Poissonian character of 11+ (the same 
will also hold for ^-{x)), see Lemma 16.51 below. We further consider the random 
variable 

(6.23) Z = !{¥' ^w,yt< n+}. 
Observe that 

(6.24) on the event {Z = 0}, C^(1b ■ = Cy(l-) and Cy{lB ■ C+) = C,(e+). 

The following lemma shows that the laws of C,- and ^+ on {0,1}^ are comparable 
with the laws Qu{i-3e/4) '^«(i+3e/4) '^^ Bernoulli percolation introduced above, 
restricted to {0, l}'^ (which by assumption can be identified with {0, 1}^). 
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Lemma 6.5. For ± denoting either + or —, the events {{C,±{z) = l})^^^ are 
independent and satisfy 



(6.25) 



P[^+{z) = 1] - e-"(i+3^/4)/(^) 

P[i_{z) = 1] _ e-"(l-3e/4)/{.) 



< c^,„n and 

< c n^'^/'^ 



Before we prove this lemma, we complete the proof of Proposition 16.31 For z G V 

let 

Pt = e""(i+^)/(^), qt = P[Ur\^)) = 1], 
p; = e-"(i-)/(^), q; = P[i_{<p-\z)) = 1]. 
Then Lemma [6.51 implies that for n>c'^^^, 

(6.27) pt <qt < C < Pz^ for all zeM. 

We now construct the sets C'^i'^^'^^ as stated in the proposition by adding to 
our probability space (fi, P) a collection j^gv of independent Bernoulli- 

distributed random variables which will fine tune the values qf to match the p^'s: 

• For every z eY\M, the parameters of and U~ are p+ and p^. 

• For z eM, U^' and have parameters pt /it (Pz ~ 1z)/i^ ~ 1z)- 
We then define C"(^±') by 

(6.28) = Co[{iU^) A Ut)lzeM + f/+l.ev\B),,v) ' 

(6.29) r(i-) = Co[{ii-{^) V t/7)l,eB + f/-l,gv\B),,^). 
Note that we then have 

(6.30) C"(^+^) C Cj,(|+) C Cj,(|_) C 

Since the variables ^± and f/J*^ are all independent (cf. Lemma [6.5p . it is elementary 
to check that the laws of C^i^'^^') agree with those of Co under Q*(i_|_^') for large n. 
Moreover, we have by fl6.24p that on the event {Z = 0} fl {.^+ < ■Cfi ^ 'C-}? 

(6.31) C"(l + ^) n 1 C Cy{lB ■ e+) C Cy{lB ■ ^U) C Cy{lB " ^) C C^^"^) H B. 

Since we already know the bound f l6.20p . it thus only remains to prove that 

(6.32) P[Z ^0]< CuU-^^. 

If Z ^ 0, there is an i < n+ such that G W. Since tx(fi') = 0, if Y' G W, then 
there exist times ti < t2 < ts such that Fj* G -B, Fj* ^ B' and again Fj* G B, see 
fl6.2ip . Using the strong Markov property we thus get 

(6.33) P[Y' eW] = P^[W]<P^[HB<L] sup P^[Hb < L]. 

wev\B' 

Note that by stationarity of the random walk with respect to the uniform distribu- 
tion, 

(6.34) P'^IRb < L] < E'^^Y^liXk e B}^ = E'^INlW/u = L\B\/n. 

k=0 
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Using Lemma 13.41 for the second term on the right-hand side of fl6.33p , we hence 
obtain 

(6.35) P[Y' eW]< c^n^+l^-^cn^-^^ + e""'"^) < c^n^^'~'^'^~\ 

Since Z has Poisson distribution with parameter un^^"' {l+3e /4:)P^[W], using 7 < /3, 

(6.36) P[Z^0] = 1- exp{-u{l + 3e/4)n^'W^[W]} < CuU-^^. 

Up to Lemma 16. 5[ this completes the proof of Proposition 16.31 □ 

Proof of Lemma \6. 51 Since the sets Wz-, z E B, are mutually disjoint and n_ is 
Poisson distributed, independent of the Y^'s, the random variables G Wz '■ i < 
n_}|, z E B are independent Poisson random variables with parameters un^~'^{l — 
3e/4)P[Y^ e Wz]. In particular, since ^.{z) = 1{\{Y' e W^ : i < n_}| = 0}, the 
events {{^_{z) = l})zeB are indeed independent. Moreover, we have 

(6.37) P[t{z) = 1] = P[\{Y' eWz-.i <U_}\=0]= e-™^-"(i-3-/4)P[^°eM/.]_ 

The above arguments apply also to ^+ and yield the analogous claims. Since the 
function is Lipschitz with constant 1 on [0, 00), we see that the left hand sides 
of f l6.25p are bounded by Cu|r2^^^P'^[iy2] — f{z)\. It is therefore sufficient to prove 
that 

(6.38) \n^-"'P^[Wz] - f{z)\ < c^n-^/\ for any z e B. 

Note the relation of this approximation with (11. 6p . 

Conditioning on the number of jumps Nl made by X in the time interval [0, L] 
and using independence of Nl and the discrete skeleton X, we have 

(6.39) P^[iyj = 5^P[iVL = r]P[z G {Xo, . . . n 5 C DJ. 

r>0 

Let us fix any r such that [2~-'^?2^J < r < [2n'^J and throughout the rest of this 
proof write = B \ D^. Summing over all possible times k when X first visits z 
and applying the simple Markov property, we obtain 

(6.40) 

P[z G {Xo, ...,Xr}riB cDz]= J2 P[Ham^} = k,Xk = z]Pz[Ha. > r - k], 

0<k<r 

where we are using the convention that H0 = 00, which occurs in the last probability 
when z = y, in which case Ay = B \ Dy = 0. Using reversibility of X with respect 
to the uniform distribution on the first probability in the product, we deduce that 

(6.41) P[z G {Xo, ...,Xr}nB cDz] = ^ ^ ^4^1uM > > r - k]. 

0<k<r 

We now claim that the following estimates hold uniformly for all n"'^'^ < k < 

r — rCl"^: 

(6.42) sup Pz\HB<k\<c^n-^^&nd.^\xpPz\Ey\B>>k\<c^{yo^n)n-^l'^. 

z'£V\B' z'eB' 

Indeed, the first estimate follows from Lemma 13.41 and the choice of 7 < /3 in 
(16.111) . while the second estimate in (16.421) follows from the Chebyshev inequality and 
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the bound Ez>[Hv\b'] < Cj^logn, which is an elementary estimate on the expected 
amount of time it takes a one-dimensional biased random walk to reach the level 
5/3 Idn. 

For any k as above, it follows from f l6.42p and the strong Markov property applied 
at time Hv\b' that 

(6.43) |n[^Xu{.} >k]- n[^Xu{.} > Hv\B']\ < c,{n-'^ + n-^'^). 

We now relate the second probability on the left-hand side to the escape probabil- 
ity to infinity from BUj^} for the random walk on the tree T^. By the strong Markov 
property applied at time Hv\b', we have (identifying z and with corresponding 
objects on T^) 



inf^/gvw pI'''[Hm = oo] 



(6.44) PJ1^X,|.^ = oo] < P.[^Xu{.} > Hv\B'] < 

By another elementary estimate on the biased random walk (dist(-B, Xri))n>0) we 
have 



(6.45) inf PyiH^ = oo] > 1 - cn 



-4/3 



Collecting the above estimates we obtain that for any n'^^'^ < k < r — n'l'^ ^ 



(6-46) n[i^XuW > - n^1^Xu{.} = ^\ 



-7/3 



and the same computations with -f^Xu{z} replaced by Ha^ show that 
(6.47) [Ha^ >r-k]- P]^ [Ha^ = oo] 



-7/3 



With estimates on one- dimensional random walk, we can compute the escape 
probabilities for random walk on the infinite tree explicitly. Indeed, by applying 
the simple Markov property at time 1, and then computing the probability that a 
nearest-neighbour biased random walk on the integers does not return to when 
started at 1, we obtain 

(6.48) Pl^[HU., = oo] = " - ^ ' ' ^ ^' 



d-l' 



and similarly, by the convention that = oo, 

(6.49) PJI^A. = oo] = / ^' 'I'^y^ 

[1, iiz = y. 

Note that in both cases, the product of the two probabilities just computed equals 
f{z), cf. (16. 6p . Inserting the estimates fl6.46p and fl6.47p into f l6.4ip . we therefore 
infer that for any r such that 2~^n'^ < t < 2n'^, 

(6.50) \P[z G {Xo,...,Xr}r\B C D,] - rn-^ f{z)\ < c^n~^+(27/3)_ 
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Using this estimate and the large deviation bound on Nl from fl4.10p in f l6.39p . we 
obtain that 

(6.51) \P^{W,] - Ln-^f{z)\ < c^n-^+^^''/'^\ 

hence fl6.38p . This completes the proof of Lemma 1^31 and thus of Proposition l6.3[ □ 

6.3. Existence of mesoscopic components. We now use the results of the last 
subsection to establish the existence of many mesoscopic components in the (ap- 
propriately modified) vacant set. In order to state the precise result, we need, as 
we have discussed before, to introduce the long-range bridges that are necessary to 
perform the sprinkling. 

Recall from Section H] that Oj = and bi = ¥[ denote the start- and end-point 
of the segment Y^, i > 1. On the same probability space {Q, P), we now define a 
family of D([0, y)-valued random variables Z^'\ i G N, j G {1, . . . , [InnJ}, with 
law characterized by the following: 

, , conditionally on a^, bi, the Z^'-^'s are independent, 

^ ■ ^ independent of the y*'s, and have distribution P^^g^.^.- 

We call Z^'-^'s the long-range bridges. Given the y"s and Z^'-^'s as above, we denote 
by G {0, 1}^ the indicator function of the vacant set left by them, i.e. 

(6.53) C = l{l^\U,<M„,,<in„{Rany^URanZ^'^}}. 

From definitions of ^„ and it follows that < iu- 

We now show that the configuration has many mesoscopic components. More 
precisely, the following proposition shows that with high probability, a constant 
proportion of vertices is contained in components of with size of order 

Proposition 6.6. For < m < u^, there exist constants c\, C2 depending on a\, a2 
and u, such that 

(6.54) P[|{x G V" : \C^{0\ > Ciri''"(i+^)^}| > Ciu] > 1 - C2 exp{-cln^ n}. 



The proof of the proposition has two parts. First, in Lemma 16. 8[ we establish 
a similar result for the configuration C,u defined in fl6.14p as the indicator of the 
complement of the segments. We then show that many of them survive adding the 
long-range bridges which will prove the Proposition 16.61 

6.3.1. Robust mesoscopic components for ^u- In order to ensure that adding the 
long-range bridges does not destroy the components of of size cin''"(i+")'^, we 
should make them more robust. We therefore impose the following more restrictive 
conditions on the components to be found. 

Definition 6.7. Let t] be a configuration in {0, 1}^ and set for I G N 

(6.55) Cl{r]) = {y E diB(x, I) : y is connected to x by a path in Cx{r]) fl B{x, I)}. 

(Note that CKt]) is contained in, but not necessarily equal to Cx{ri) fl diB{x,l).) 
Given a positive parameter h, a given site x E V is said to be /i-proper under the 
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configuration 7] , if tx{B{x, 3 P Id n)) = and the following two conditions hold (recall 



(6.56) 



r.;|Cfi'^"(r^)|>/.mf(\% = /^n''"(-^)^ 

(ti) |Cj(r/)| < m('(f2;) for all y G B{x,f3\dn), I G [lo,h] n N, 
where Iq = [10 lnlnn/f„(i_e)] , h = [(3\dn\. 
The next lemma proves the existence of many proper sites. 



Lemma 6.8. For 13, e as in ( 16. lip .( 1619]) there exist constants Csi^u), C4{u) and 
c(ai, u) such that 

(6.57) P \{x E V : X is Cs-proper under > c^n > 1 — cexp{— In'^ n}. 

Proof. We first show with Proposition 16.31 and estimates on branching processes 
that the expected number of proper vertices is of order n and then we show that 
this number is concentrated around its expectation. Throughout this proof, we 
abbreviate m„(i+e) and mu(i-e) by m+ and m_. 

Consider x E V such that tx(_B(x, ai Idn)) = 0. We estimate the probabihty 
that X satisfies condition f l6.56p (i) with /i > to be chosen. Since 5/3 < ai, using 
Proposition O with m(1 + e) < (m + (cf. ([S3])), 



(6.58) 



-2/3 



For u{l + e) < {u + Ui,)/2, the branching process induced by Q*^;^,^^-) is supercritical. 
Hence, [AN72[ Theorem 2, p. 9] implies that for h < chosen small enough, the 
first term on the right-hand side of fl6.58p is bounded by 1 — c„ for n > c'^. Therefore, 
letting h be some strictly positive constant C3 < c„. 



(6.59) 



supP[|Cf"i"(e„)|<C3m^"^"] <1 

n>Cu 



We now treat condition fl6.56p (ii). Since m„ > 1 + Cu, we can use |Ath94| Theorem 
4] to find a 6u E (0, c^) such that (here, E* denotes Q*-expectation) 



(6.60) 



Hu = sup sup E* 

n l>0 



exp <^ Ou 



\ci 



< 00. 



We claim that for Iq and h as in (16.560 . and e as in (16. 9p . 



(6.61) 



ICil > m^y^^'' for some / > In 



< c„exp{-c^ln^ra}. 



Indeed, by the exponential Chebyshev inequality, the left-hand side of (I6.6ip can be 
bounded from above by 

(6.62) 



l=lo 



exp<j^,(i_,)^ 



> exp |^„m 
< Hu(i-e) exp I - eu{i-e)m'_' I 



(1/4)/ 1 



l=lo 
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A;=0 

(lnd>l) 

< c„exp{-c^ln^n}. 
This proves fl6.6ip . It follows that 
(6.63) 

Cl{^u)\ > mL^^^^' for some y G /31dn), /q <l<h 

Prov. 16.31 ( ^ 

< cn^l Qlii-e) [K\ > m^-' ^ for some / > /q] + c^n-^f^ \ < c^n-^ . 

The above bound, together with (16.591) . allows us to conclude that for all n > c^, 
P [x is C3-proper under ^u] > > 0. Summing this probability over the vertices 
X with tx{B{x,3/3\dn)) = (which have positive proportion by Lemma [6?T|) we 
obtain that 

(6.64) E[|{x G : X is ca-proper under ^u}\] > c^n. 

We now show that the number of Cs-proper points concentrates around its expec- 
tation. To this end we use a concentration inequality in |McD89j . Lemma 1.2. We 
first consider a slightly modified configuration ^ G {0, 1}^, where we consider only 
the first [2n"'] jumps of each Y^: 

(6.65) e = e(r°,F\...,r^^"-^) = l{^\ U ^[o„.„.,(y-)AL]}, 

i<Mu 

where Tk{Y'^) is the time of the k-th jump of (we set Tk{Yi) = L iiY^ jumps less 
than /c-times). We define a function 

(6.66) /(F°, . . . , r*^""i) = I {x G \^ : X is cg-proper under |. 

We claim that, writing Y for (y°, . . . , y*^"-i), 
(6.67) 

if Y and Y' differ in at most one coordinate, then \ f{Y) — f{Y')\ < crC^^ . 

Indeed, changing one segment F*, we can change at most 2rC values of ^. Moreover, 
the event that a given point x G is ca-proper under ^ only depends on the values 
of in -B(x, /3 Idn), which has volume bounded by cra^. This gives (16.671) . Note that 

E[/] > E[/l^^=|] = E[|{x G 1/ : X is cs-proper under ■ 1^^ J 
(6-68) jg^U 



> Cun-n- P\E,u ^ i], for n > Cu- 
The bound (I4.10p implies that 

(6.69) P[e„ ^ el < M^P[Nl > 2n^] < exp{-c„n^}. 
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Hence, we have that E[/] > Cufi for n > c'^. Setting t = ^CuU, with the same constant 
as in the lower bound on E[/], we obtain 

P[\{x eV: is C3-proper under ^Jl < t] < P[i, ^ ^] + P[E[/] - f > t] 
< exp{-c„n^} + 2exp{-cV-i+^-2{7+/3)|^ 

where we have used Lemma 1.2 in |McD89j . together with fl6.67p . in the last in- 
equality. Since 1 — 7 — 2/3 > 1 — 3/9 > 0, this estimate is more than enough to 
imply fl6.57p for appropriately chosen constants C4 and c. This concludes the proof 
of Lemma 16.81 □ 

6.3.2. Robustness of proper sites. In this sub-section we prove that the components 
around /i-proper sites (as in Definition 16. 7p are really robust with respect to pertur- 
bation. Observe that the following lemma is completely deterministic. 

Lemma 6.9. Let f3, e he as in f l6.9p . (16.1 ip and define the class S of configurations 
m {0,1}^, 

(6.71) E = [t] e ■.\{x eV : X is C3-proper}| > c^n^. 

Let rj ^ 'E and rj' G {0, 1}^ be such that r]'{z) 7^ r]{z) for at most n^~'^ Ivl* n vertices 
z eV. Then there exists a constant c{ai,u), such that 

(6.72) \{xeV : Kiri')\ > cn'^"<^+^>''}| > cn. 

Proof. In this proof, we use the word "proper" to mean "ca-proper under t]" and 
use m+, m_, f+ and V- to abbreviate mu{i+e) , 'n^u(i-e), Vu{i+t) and Vu{i-e)- We will 
use the term string to refer to a self-avoiding path on V with length li = [/3 Id n\ , 
as in f l6.56p . For e S, we are going to choose a particular collection of strings, 
which will be contained in supp?7, as follows. First, we take a collection of proper 
vertices H = {xi, . . . , a;Lc4nj} ^ V, according to some pre-defined order. Again using 
some arbitrary order, for each / < \_C4^n\ , we insert into F^ [can^+'^J distinct strings 
starting at G H and contained in supp?]. Such a collection exists due to f l6.56p (i) 
(see also f l6.55p ). Denoting by |F^| the number of strings in F,,, we have 

(6.73) |F^| = [C4nj ■ [can^'+'^J. 
Since for all / < [c4nj, B{xi,2f3logn) has tree excess zero, 

(6.74) every string in F^ is uniquely determined by its end-points. 

Let Sy be the number of strings in F^ intersecting y. We claim that, for any given 

(6.75) Sy < epilog ny'- ■ni"-'^. 

To show this claim, observe that the fact that the starting point of every string in 
F^ is proper together (16.561) (ii) imply that if there is a string intersecting y, then 

(6.76) l^y('7)l ^ m^^'*'" for every integer / G [/oj^i]- 

We bound Sy by splitting the set of strings intersecting y in the following way: 

h 

(6.77) Sy = ^{strings in F,, intersecting y and starting at distance / from y}. 

1=0 



GIANT COMPONENT LEFT BY RANDOM WALK 36 

Since the strings are contained in supp?], using (16 .74^ . for n > Cu,we obtain 
(6.78) Sy <f2K{v)\K-'{v)\ < 2XilCS(^)l|Ci-'(r/)| +2X:iCj(r/)||Cj-'(r^)|. 

1=0 1=0 l=lo 

Using (16 .76 p . the bound \Cy \ < c{d — 1)'^ for k < Iq, and Iq < c„lnlnn, we get 



li/2 

(6.79) Sy < Culnlnn ■ {d - l)'omL^/^^'^ + 2 

1=0 



«o_(5/4)/i j_ ^ X^^(5/4)/^(5/4)ai-0_ 



From (16. 8p . it follows that m'i < n"-^. Hence, 

li/2 

(6.80) Sy < Cuilnny'-n-i"'^ + 2^nt^-^ < c„(lnn)'=" 

1=0 

This proves (I6.75p . 

Our next step is to show that there exists such that 

(6.81) for n > c^, at least half of the strings of are contained in suppyy'. 



Indeed, we know that i]'{z) ^ ri{z) for at most '^In n vertices z & V. This, 
together with (16. 75 p . implies that at most Cu{\TLnY^n^~'^^i'"-^ strings in F^ are 
not contained in suppr^'. Since 7 = f+/3/2 (cf. (16. lip ), we obtain by (I6.10p that 
1 — 7 + < 1 + Therefore, due to (I6.73p . for n > cj^, at least half of the 

strings in F^ are contained in suppr/'. This gives us (I6.8ip . 

Let us recall that in the construction of the set F^, we have chosen a collection 11 
of [c4nj proper vertices in V , and for each of these vertices, we have picked [csn^+^J 
strings starting at Xi. We claim that 

, , for n > Cu, at least Ly/iJ of the vertices in 11 have 

^ ' ■' at least ^71"+^ of their strings contained in supp?]'. 

Indeed, otherwise the number of strings in F^ contained in supp t]' would be bounded 
by 

ta or>\ ^'^ v+S I '^'^4 C3 C3C4 1+^ 

(6.83) —n ■ c^n +^ + —z-n ■ —n +^ < ——n 

8 8 8 4 

contradicting (I6.8ip for n > Cu- 

Since distinct strings starting on a vertex in 11 have distinct end points, each 
vertex x as in (I6.82p satisfies |Ca;(?7')l — ^'n'"+'^. Choosing c as we deduce 

(I6.72P for n > Cu- By possibly decreasing c = c„ in such a way that \_cn\ = for the 
other finitely many values of n, we obtain Lemma 16.91 and thus complete the proof 
of Proposition 16.61 □ 



6.3.3. Mesoscopic components for With Lemmas 16.81 and 16. 9[ we have all tools 
to finish the proof of Proposition 16.61 stating the existence of many mesoscopic 
components of the complement of the segments and the long-range bridges. 
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Proof of Proposition \6.6[ The configurations and differ only on vertices visited 
by the bridges. Hence, setting V = \{z E V : C,u{z) ^ ^U-^)}!' denoting by A^*'-' the 
number of jumps of the bridge Z*'-' , for n > e"*, and using < Cun^~"' ■, we obtain 



(6.84) 



P[V> n^~^ In^ n] < P ^ ^ A^*'-'' > c^n^"^ In^ 

i<Mu,j<\Dn 



n 



<Py IJ {A^*'^' > In^njJ < c^expj-cln^n}, 

where we have used (14. lip from Lemma 14.21 in the last inequahty. By possibly 
increasing Cu we conclude that the equation above holds for every n > 1. Finally, 
taking c as in Lemma 16.91 using Lemma 16.91 



(6.85) 



cn 



< P 



|{x : X is Cs-proper under < c^n + P [V > '''In^n] . 



By Lemma 16.81 and (16.841) the last expression is bounded by c„e' 



The proof of 



Proposition 16.61 is then finished by choosing the constants ci and C2 appropriately. 

□ 



6.4. Proof of Theorem 11.21 and sprinkling. We can now approach the second 
part of the proof of Theorem II. 2^ that is the sprinkling construction. 



Proof of Theorem Let u be as in the theorem and choose 6 > such that 

1 



(6.86) 
Set Un and u' as 
(6.87) 



25 < 1 



200 



1I6TTI1 

< 



7(m). 



u 



Ur 



{u + n~^) A m'. 



Throughout this proof, we write e and 7 for e = e(u') and 7 = 7(u') as in (16. 9p and 
(16.111) with u replaced by u' . 

The strategy of this proof is the following: we apply Proposition 16.61 to ^'^ as 
defined in (I6.53p . This will show that with high probability, there are at least cin 
vertices in components of volume Cin^"'(i+')^ in supp^^^ ^ supp^^,. In what we 
call the sprinkling construction, we then erase some of the y"s in the definition 
of E,'u„i and thereby increase the configuration to a new configuration By 
construction, the sprinkled configuration will be close in distribution to the 
vacant set left by the random walk trajectory X[o,„„]. Moreover, we will prove that 
with high probability some of the components of supp^^^^ will merge and form a 
component of size pn as we increase ^'^^ to thus proving Theorem 11.21 

We divide the proof into the following three steps: in the first step, we construct 
the sprinkled configuration and reduce Theorem 11.21 to an estimate on In 
the second step, we apply Proposition 16.61 to prove that the original configuration 
is sufficiently well-behaved. In the third and final step, we deduce that with 
high probability, supp has a component with volume at least pn and conclude. 

Step 1: The sprinkling construction. For the sprinkling construction, we use 
an auxiliary probability space ({0, 1}^"", Q), for M^^ defined in (I6.13p . Under 
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the measure Q, the canonical coordinates {Rk)a<k<Mu„ are i.i.d. BernouUi random 
variables with parameter 

(6.88) q = n^^\ 

Recall that the configuration was defined in fl6.53p as the indicator function of the 
set of vertices not visited by and Z*'-' for i < M^^ and j < [In n\ , constructed on 
a suitable probability space {Q, P). On the probability space (f2 x {0, 1}^^"" , P ® Q), 
we will now construct from the sprinkled configuration roughly according 
the following procedure: first, we remove all F^'s such that Ri = 1. If possible, we 
then construct a trajectory by linking the remaining segments with the bridges Z^'\ 
see Figure [3] for a sketch. 



Figure 3. Schematic illustration the choice performed by ip. The 
horizontal lines are the trajectories of the segments and the arcs 
are the trajectories of the bridges Z^'^ . The top picture illustrates the 
configuration The sprinkled configuration obtained from the 
realization of the Bernoulli random variables Ri associated to Y^ is on 
the bottom picture. 

For the precise construction, we define the increasing random sequence ki of indices 
k < for which R^ = 0; the sub-indices i here run from to the random variable 

(6.89) I=\{k:0<k<Mu^,Rk = 0}\. 

We now construct a function that concatenates the F^'s with Ri = 0, i < Mu„, 
and some of the bridges Z^'^ into an element of D{[0, un], V), for u as in the theorem. 
There are two situations in which this construction fails. First, if J < M„, then there 
are not enough segments left. Second, if fcj+i — ki > [InnJ for some i < I, there 
is no bridge connecting Y''' to y'^'+i. Let us hence refer to the intersection of the 
complements of these events as the good event Q, 

(6.90) g = {I> Mu} n {ki+i -h< [\nn\ for all 1 < z < M„} C {0, 1}^^"". 

Letting d be some arbitrary constant trajectory of length un, we now define tp : 
n X {0,1}^"- ^ D{[Q,un],V) as 

(6.91) 

9, on f2 X Q'^, and otherwise: 



lD([0,'un],V)' 

where |D([o,nn],v) denotes the restriction to D{[0,un], V) and X is the concatenation 
mapping defined in f l4.2p (Here we abuse the notation slightly. The mapping X 
takes infinite number of arguments, however since un < Mu{L + i) the restriction 
to D{[0,un], V) does not depend on the arguments which we do not specify). The 
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sprinkled configuration is then defined as the indicator function of the vacant set 
left by the concatenated trajectory, 

(6.92) r^ = l{\/\X[o,„„]}o^, 

where we have used the notation X for the canonical coordinate process on the space 
Di[0,un],V). 

By construction, we then have supp^^^ C supp^^^ on Q x Q. Moreover, con- 
ditionally on Q X Q, the concatenation ip is distributed according to the piecewise 
independent measure Q'^"' defined in Section SJ 

Let us now see that the event Q is indeed typical. The random variable / is 
binomially distributed with expectation M„^(l — q) = + n^~'^~^(l + o(l)). With 
the help of a Chernoff bound (see Lemma 1.1 in [McD89j ). we find that 

(6.93) Q[I < M„] < exp(-c„,5r22(i-^-^)/M„J < e^^{-Cu,sn^-'<-^'). 
Together with a simple union bound, this implies that 

(6.94) Q[g'] <Q[3i<I: h+i -h> [\nn\] + Q[I < MJ 

< c«n^~^gLinnJ +gxp(-c„5r^^"^"^^) 



< c^n^-^-^-^Li-J ^ exp(-c„,5ri'-^^'') < c^,a,sn-'' 



In particular, the distribution ° (P ® Q) is close to Q. Indeed, for any J^un- 
measurable event A, we have (^""[A] = P (S) Q['?/'^"'^(/l)|fi x Q], and therefore using 
an easy calculation 

(6.95) |Q™(A) - (P ® Q){^-\A))\ < Q{g') < Cu^^^sn-". 

Thanks to this estimate, we know that the sprinkled configuration E,^'^ is close in 
distribution to the vacant set left by a trajectory under the piecewise independent 
measure Q"". Together with Lemma I4?T| we can now reduce our task to proving the 
estimate in Theorem 11.21 for the configuration left by We set 

(6.96) p = Ci/100, 



where ci was defined in Proposition 16.61 By Lemma 14. H 

(6.97) P[K,J < pn] < g™0^max(l{^\X[o,.„]})| < pn] + exp{-c„ In^ n}, 
which by fl6.95p implies that 

(6.98) ^OCaxI <pn]<P® QO^max(r^)l < pu] + 0.,^^"^ 

It is therefore sufficient to show that 



(6.99) p® Q \CmUen\<pn 



Step 2: ^'^^^ is well-behaved with high probability. In the second step, we apply 
previous estimates in order to deduce that has the properties we will use to 
show that a component of size pn appears in supp 

Let Ci, C2, . . . be the connected components of supp ^'^^ ordered according to their 
volume, Ci being the largest component, and define the random variable k G NU{oo} 
as the smallest integer such that 

(6.100) |Ci| + ■■■ + |C«| > cin = lOOpn, 



GIANT COMPONENT LEFT BY RANDOM WALK 



40 



provided such an integer exists, and k = oo otherwise. Note that if supp contains 
many large clusters (in the sense of Proposition 16. 6p . then k is small. More precisely, 
we have the following event inclusion, 
(6.101) 

{|{a; e V : \C^iCJ\ > Cin''"'(i+^)^}| > cm} C {k < [n^-''"'(i+^)^] } =: JC C n. 



Hence, Proposition 16.61 (and monotonicity of ^') is more than enough to imply that 

(6.102) P[/C'] < Cu,an~''. 

We further define an event S G Q a.s the event that the numbers of jumps A^* of all 
y"s and the total length A^*'-' of all Z*'-^'s appearing in the construction of ^'^^ do 
not exceed their expected value too significantly: 

(6.103) S = {N' < 2n^, < M„ J n | ^ A^''^' < In^n ■ n^"^}. 

i<Mun j<lnn 

By Lemma 14.21 we know that 

(6.104) P[S'] < Cu,.n-''. 

The estimates fl6.102p and fl6.104p will allow us to prove the required estimate fl6.99p 
in the last step by considering only configurations satisfying the properties in 

Kns. 

Step 3: has a large component with high probability. Finally, we prove the 
required estimate fl6.99p by showing that the random deletion of F^'s in the con- 
struction of does make a component of size pn appear with high probability. 

We define an event V as 

{There exists a partition of {1, . . . , k} into sets A and B,' 
such that A = UaeACa is not connected to i3 = UbesCb in 
supp^"*^, and both |^| and \B\ are larger than lOpn. 

In analogy with the proof of Proposition 3.1 in |ABS04j . we claim that 

(6.106) {\CmUCn\ < P^} n{K<oo}CV. 

To see this, we consider the equivalence relation ~ on the set {1, . . . , k} given by 
j ~ j' if and only if Cj is connected to Cj' in supp^*^. Then every equivalence 
class corresponds to one component of supp In particular, if all components of 
supp^'^^ are smaller than pn, then the sum of \Cj \ for all j's in the same equivalence 
class must also be smaller than pn. So, we can partition the set {1,...,k} into 
sets A and B in such a way that equivalent indices belong to the same set and 
I J2a€A - J2b€B \^b\\ < ^pn. Siucc Y,j l^jl > lOOpn, we obtain that J2a€A l^^l 
and J2beB \^b\ > lOpn, and, by construction of the equivalence relation ~, Ua^A^a 
is not connected to UbesCb through supp^*^. This shows (16.1061) . 

For subsets F and F' of V, we use the notation F Si F' to denote the event 

(6.107) {F and F' are not connected in supp^'^-^}. 
By fl6306|) and /C C {k < oo}, 

(6.108) P ^ahc^Ueni < pn] < P[S'] + P[/C1 + E[l^nsQ[V]], 
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where we have used that S,IC C ^ and Fubini's theorem. For the sake of clarity, 
let us recall that Q is a measure on {0, 1}^^"" and emphasize that the Q-probability 
in this last expression is computed for y* and Z^'^ fixed. On /C, there are at most 

21"' " 'I ways to partition {1, . . . , into A, B. Hence, on /C, using the union 
bound, 

(6.109) Q[V] < 2r«"""'<^+^^'l sup Q[A^B], 

where the supremum is taken over all partitions of {1, . . . , k} as in (16.1051) and A, 
B are defined in (16.1051) . too. By increasing the range of the supremum we deduce 
that the following estimate holds uniformly on the event /C, 

(6.110) < 2r"'~""'''+'''l sup Q[F^F% 

F,F'CV:\F\,\F'\>Wpn 

We will now find a bound on the event on the right-hand side, valid uniformly on 
the event S. To this end, fix and Z^'^ such that S holds, as well as subsets F and 
F' of V containing at least lOpn vertices. Using the expansion property of the graph 
G, see (12. lip , and the Max-fiow Min-cut Theorem, we can find a collection of at 
least c^n disjoint paths in V joining the sets F and F' for some constant C5 > 0. We 
call these paths connections. Since, on S, J2i<Mu smaller or equal to 2u^,n (see 
(I4.10p and (I6.103P ). we can extract from this collection a sub-collection C such that 
\C\ = ^c^n, and such that all connections in C intersect at most [4m^/c5J segments 
Y\ 

We next want to prove that with high probability, at least one of these connec- 
tions only intersects F^'s that do not appear in ^'^p, using again the concentration 
inequality from |McD89j . Lemma 1.2. To this end, we define the function g by 

(6.111) g{Ri,...,RM^J = \{CeC : for all i, either C n = or i?^ = 1}|. 

The probability that all of the at most [4tij,/c5j y"s intersecting a given ( ^ C have 
an index i with i?j = 1 is at least gl-^*'*/^5J. So, for some cq = CQ{ai, a2,u), cj > 0, 

(6.112) E%] > ic5gL4"*/'^^Jn =: 20671^-"'^. 

Changing one segment can change the value of g by at most 2n'' on S. Therefore, 
by Lemma 1.2 in |McD89] . 

_ 2 2(l-cr<5) 

(6.113) Q[,<cen^-i<2exp{ }. 

If > cqti^''^''^, then there are at least c^n^''^''^ disjoint connections in C linking 
F and F' and only using vertices in supp^*^. In accordance with (I6.87p . we now 
choose S such that 2c-r6 < 7. Then, since on S the total length of the bridges 
X^jj-ZV*'-' < In^n ■ n^""^, the events {g > cgn^"'^'"'^} and S imply that at least one 
connection in C is contained in supp.^"*^, for n > Cu,&. Hence, (I6.113p implies that 
uniformly on the event 5, 

(6.114) sup <2exp(-c^n^-^^2c75^)_ 

F,F'(ZV:\F\,\F'\>10pn 
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Inserting this estimate into fl6.110p . noting that 



(6.115) l-7-2c75> 1-27^ 



1 — Vu'{l+e)l3, 



we find that, uniformly on /C fl 5, 
(6.116) Q[V] < exp{~Cu,5n 



1-27 



Using this estimate, together with the bound fl6.102p on P[/C'^] and the bound (16.1041) 
on P[S^], in fl6.108p . we find fl6.99p for > 1 by possibly adjusting the constants. 



This section contains the proof of Theorem II. 3[ that is of the uniqueness of the 
giant component. More precisely, we show that for any choice of /t > and m < m^, 
with a high probability, the second largest component of the vacant set V" is smaller 
than Kn. The sprinkling is again the major ingredient of the proof. This time, 
however, we will really use the fact that u„ — u < n~^. 

Heuristically, our argument runs as follows. We will show that any component of 
of size at least nn should contain at least Kn/2 vertices that were included in 
clusters of size at least n'^"^/^ of the vacant set left by segments at level u„. Hence, 
in order to have |Csec| > i^n, there should be two groups of such vertices which do 
not get connected after the sprinkling. A small extension of the proof of the last 
section then shows that this happens with a small probability. 

Proof of Theorem \1.3i We choose 7, /3 as in (16. lip and recall from (I6.87P the notation 
Un = {u + n~^) A{{u + Ui,)/2), where 26 < 1— 7. By decreasing 6, we can also assume 
that (5 < 7. Recall also that the vacant set left by segments was defined in (I6.14p 
(with Un replaced by u). 
We divide the vertices of G into three sets. The vertex x E V is called small, if 



(7.1) tx(S(x,5/31dn)) = 0, < Id'n, and C,(e„J C B{x,(3\dn). 



tx{B{x,5p\dn)) = and (i), (ii) of (I6.56P hold with h = C3 of Lemma [6.8[ 

It is called bad otherwise. We will use B to denote the set of bad vertices. 

The next lemma shows that the set B is small. The lemma should be viewed as an 
analogue to a non-existence of intermediate components in the Bernoulli percolation 
case. 

Lemma 7.1. There exists a function g{n) such that \im.n^ao din) / n = and 



We postpone the proof of the Lemma 17.11 and proceed with the proof of The- 
orem 11.31 First, we add long-range bridges to the configuration ^u„, that is we 
define as in (I6.53p . We must be careful to see that these bridges do not destroy 
the components of proper vertices in supp^^/. To this end we collect a family F of 
strings (that is of self-avoiding paths of length [/3 Id n\ ) as in the proof of Lemma 




□ 



7. Uniqueness of the giant component 




(7.3) 



lim P[\B\ > g{n)] = 0. 
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This collection contains [0371.'^^+ J distinct strings starting at x for any proper ver- 
tex x, as before. In particular, this implies that |r| < C3n^+^'"+. We can show, 
as below fl6.75p . that the number 5*^, of strings intersecting a given y & V satisfies 

Sy < CM(log^" n)n4:'" . Let now S be the event that all segments and bridges are not 
too long, as in (16.1031) . On 5, the number of strings that are intersected by bridges is 

thus at most Cu(log'^" n)ri'~"'^'i'"~^ . We declare a proper vertex had proper, it at least 
half of the strings starting at this vertex is intersected by the bridges. Obviously, on 

(S, the set BV of bad proper vertices must satisfy \BV\^n^'"+ < Cu(log'^" n)n^~'^~^i^~'^. 
With flOOj) and f l6TT|) . it follows that 

(7.4) \BV\ = o{n), as ri — )■ 00, on S. 

On the other hand, the remaining proper vertices, that we call large, are starting 
vertices of at least ^n'"+^ strings which are not intersected by the bridges. Hence, if 
X is large, it is contained in a component of of size at least |n^+'^. This implies 
that 

on S, the number of components of supp C,'^^ that contain a large 

vertex is at most cn^~^^+. 
We now perform the sprinkling as in Section [HI Recall that on the probability 
space ({0, 1}*^"", Q) we have defined i.i.d. random variables (-Rfc)o<fc<M„„ with suc- 
cess probability q = n"^"^ (cf. (I6.88P ). the number of remaining segments / (cf. (I6.89P ) 
and the good event Q (cf. (I6.90p ). We have then constructed the sprinkled configu- 
ration (cf. (I6.92p ). Using Lemma [4. ![ then the estimate (I6.95p . as n — t- 00, 

(7.6) 

P[\C:J > nn] < Q--[\C:J > Kn] + 0(1) 

<P(^Q[\Csec{en\>^n,g,S,\B\<g{n)]+V[g''] + P[S'^] + P[\B\>g{n)] + o{l) 

lOit . 1 16. 1041 1. Il73t 

< P ® Q[\Csoc{Cn\ > ^n,g,S,{\B\<gin)}] + o{l). 

In order to estimate the term on the right-hand side, we claim that for any k > 0, 

X on iS n ^ n < 5'(n)}, any component of supp^'*^ of size > nn 

' contains at least Kn/2 large vertices, for n > Ck,u,5, 

Indeed, recall that we have divided the vertices in supp^^^ into small, bad, bad 
proper and large vertices. The vertices in supp^**^ consist of these four sets, and 
the set {x E V : ^u^{x) = 0,^'*^(x) = 1}. Let us call all the vertices in this last set 
sprinkled vertices. Suppose now that the event S H Q H {\B\ < g{n)} occurs. Then 
by definition of S and Q, the number of sprinkled vertices is at most 

(7.8) (M„„ - I)2n^ + cn^-^ In^ n < (M„„ - M„)2n^ + cn^"^ In^ n < Cu,5n^'^ 

because Mu„ — < Cu^sti^^'''^^ and 5 < 7. Consider now any component A of size 
nn of supp^**^. Then the number of vertices in A that are either bad, bad proper 
or sprinkled is at most \B\ + \BV\ + Cu,sn^~^ = o(n), by definition of g{n) and fl7.4p . 
the remaining vertices being either small or large. By definition of small vertex, all 
small vertices in supp belong to components of size at most Id^ n. Any of the at 
most Cu^sn^'^ sprinkled vertices can merge at most {d — 1) such components, so the 
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maximum number of small vertices belonging to the same component of supp is 
bounded by Cu^si^d^ n)n^~^ , which is less than o(n), too. Hence, for n > Ck,u,5, at 
least Kn/2 of the vertices in A are large, proving (17. 7p . 

Let C be the set of large vertices. Defining V to be the event 

(7.9) V = {there are A, B C C such that \A\ > Kn/2, \B\ > Kn/2, and A ^ B}, 
we obtain from (17. 7p . for n > Ck,u,s, 

(7.10) P ^ Q[\C,ec{en\ > ^n,g,S,{\B\ < g{n)}] <P(^Q[V',S] = E[lsQ[V% 

which is essentially equivalent to the right-hand side of (I6.108p . Note also that since 
supp^^„ C supp^^P, V equals 

{there are A, B C C such that \A\,\B\> Kn/2, and U.^a C,(CJ ^ UbesCbiCuJ}, 

We should now bound the number of possible choices for the unions in the equation 
above. By (17.51) . there are at most n^'^^+ sets of the form Cx{^u^) with x E C, so 

there are at most 2^"^ choices for the unions in the equation above. Hence, 

(7.11) E[l5Q[P']] < 22""'"+ sup sup Q[F ^ F']. 

S F,F'cV:\F\,\F'\>nn/2 

Repeating the argument from (I6.110p to (I6.114p and choosing 6 small enough, we 
infer that the right-hand side of (17. lip tends to zero as n tends to infinity. With 
(17. 6p and (I7.10p . this completes the proof of Theorem 11.31 □ 



We now prove the lemma we used in previous proof. Due to Proposition 16.31 this 
proof will be reduced to estimates on a branching process. 



Proof of Lemma \ 7. 1\ Let a; G be an arbitrary vertex and let B^ = B(x, /Sldri), 
B'^ = B{x,5f3\dn). We write f+ = V- = Vu{i~.e), see (16. 8p for the notation. 

It is easy to see that B C Bi U ■ ■ ■ U 84^, where 



(7.12) 



Bi 


= {x 


e V 


tx(i?i 


B2 


= {x 


G V 




B3 


= {x 


G V 


tx(i?; 


B, 


= {x 


G V 


tx(S; 



By Lemma [6. II we have < g{n) for a sequence g which decays as in the state- 
ment, deterministically. Further, by (16. 63 p . we have P[x G B^] < cn~'^. Therefore, 
using the Markov inequality, there is a sequence g{n) such that g{n)/n tends to zero, 
such that lim„_j,oo PllB^l > g{n)] = 0. 

It remains to control B2, B3. We choose any e > small enough such that (16.90 
is satisfied, noting that these two constraints allow us to make e > even smaller. 
For n > cs^u, we then have Un G (m,m(1 + e/4)). Hence, the random sets 
constructed in Proposition 16.31 (with e replaced by e/2) dominate the component 
Cy{lB-^u„) from above and from below with probability at least 1 — Cu,s,en~^'^ , for n > 
Cu,s,e- Recall also that C"'^^^^/^) are distributed as Co under Qu(i±e/2)- 
branching process description of C'^^^~^^/'^\ that is = \{y G C'^^^~^^^'^\dist{x,y) = 
k}\. Similarly, let Z'^ be such a description of C"*^^"^/^^ By our choice of parameters 
and Lemma [6. 2 [ both and Z~ are supercritical branching processes. We use T+, 
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T_ to denote their extinction times, (^+, 0_ their offspring generating function, and 
g+, g_ their extinction probabihties. Observe that 

(7.13) hmg_ — g+ = 0. 

Set r = [f3\dn\, Nn = c^n^'"+ . Using Proposition 16.31 we get 

(7.14) P[xe B2] - c^,s,en~''' < P[Z; < Nn, Z+ > 1] 

< P[i < z; < Nn] + P[o = z; < z+]. 

Since f. is strictly decreasing, Nn/n^^<^+^/'^'> 0. Hence, the first term on the right- 
hand side is the probabihty that the branching process is not extinct at generation 
r, but is much smaller than its typical size n''''"(i+«/2) . This probability tends to as 
n — )■ 00, using e.g. Theorems 6.1, 6.2 in Chapter I, p. 9, of |AN72j . Using the fact 
that the generating function of Z^ is the r-th iteration (f)±.^ of 0, we get that 



(7.15) 



limsup P[0 = Z7 < Z+] = limsup0^'^(O) - 0^'^(O) = q- - q+. 



Here we have used that q± is the attractive fixed-point of (j)±. Using fl7.13p . this 
can be made arbitrarily small by choosing e small. Inserting this back into fl7.14p . 
we get lim„_j.oo P[x G B2] = 0. This implies, using the Markov inequality again, that 
lim„_j.oo P[|'B2| > g{n)] = for some g as in the statement. 
Similarly we have. 



P[x e B3 



cn 



-2/3 



< P 



z; <Nn,J2^k >'^d^n 



k=0 



(7.16) <P[Z;<Nn,T. 



00 



P[^Z+ > ld'n,T_ < 00 



k=0 



< P[i < z; < Nn] + p[ Vz+ > id'n,r+ <oo\+ P[r_ <t+ = oo 



k=0 



The first probability on the right-hand side tends to 0, as in the previous argument. 
By [AN72j . Theorem 12.3 in Chapter I, p. 52, conditioned on T+ < 00, has the 
law of a sub-critical branching process. Using this claim it is easy to show that 
the second probability in fl7.16p tends to zero. The third probability can be made 
arbitrarily small by choosing e small, by using fl7.13p again. This then implies that 
lim„_^oo Pll-Bsl > gin)] = for an appropriately chosen g, as in the previous case. 
This completes the proof of the lemma. □ 
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